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Res9arch  tnftlatlon  Program  -  1987 
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AFOSR  has  provided  funding  for  follow-on  research  efforts  f-or  the 
participants  In  the  Summer  Faculty  Research  Program.  Initially  this 
program  was  conducted  by  AFOSR  and  popularly  known  as  the  Mini -Grant 
Program.  Since  1983  the  program  has  been  conducted  by  the  Summer  Faculty 
Research  Program  (SFRP)  contractor  and  Is  now  called  the  Research 
Initiation  Program  (RIP).  Funding  Is  provided  to  establish  RIP  awards  to 
about  half  the  number  of  participants  In  the  SFRP. 

Participants  In  the  1987  SFRP  competed  for  funding  under  the  1987 
RIP.  Participants  submitted  cost  and  technical  proposals  to  the. 
contractor  by  1  November  1987,  following  their  participation  In  the  1987 
SFRP. 


Evaluation  of  these  proposals  was  made  by  the  contractor. 
Evaluation  criteria  consisted  of: 

1.  Technical  Excellence  of  the  proposal 

2.  Continuation  of  the  SFRP  effort 

3.  '  Cost  sharing  by  the  University 

The  list  of  proposals  selected  for  award  was  forwarded  to  AFOSR  for 
approval  of  funding.  Those  approved  by  AFOSR  were  funded  for  research 
efforts  to  be  completed  by  31  December  1988. 

The  following  summarizes  the  events  for  the  evaluation  of  proposals 
and  award  of  funding  under  the  RIP. 

A.  Rip  proposals  were  submitted  to  the  contractor  by 

1  November  1987.  The  proposals  were  limited  to  $20,000  plus 
cost  Sharing  by  the  universities.  The  universities  were 
encouraged  to  cost  share  since  this  Is  an  effort  to 
establish  a  long  term  effort  between  the  Air  Force  and  the 
university. 

8.  Proposals  were  evaluated  on  the  criteria  listed  above  and 

the  final  award  approval  was  given  by  AFOSR  after 

consultation  with  the  Air  Force  Laboratories. 

C.  Subcontracts  were  negotiated  with  the  universities.  The 
period  of  performance  of  the  subcontract  was  between 
October  1987  and  December  1988. 

i 

Copies  of  the  Final  Reports  are  presented  In  Volumes  I  through  III 
•f  the  1987  Research  Initiation  Program  Report.  There  were  a  total  of  83 
IIP  awards  made  under  the  1987  program. 
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STATISTICS 


Total  SFRP  Participants  159 
Total  RIP  Proposals  submitted  by  SFRP 
Total  RIP  Proposals  submitted  by  GSRP 
Total  RIP  Proposals  submitted  124 

Total  RIP'S  funded  to  SFRP  81 

Total  RIP'S  funded  to  GSRP  2 

Total  RIP'S  funded  83 

Total  RIP'S  Proposals  submitted  by  HBCU' 
Total  RIP'S  Proposals  funded  to  HBCU's 


Laboratory 

SFRP 

Particioants 

RIP'S 

Submitted 

RIP'S 

Funded 

AAMRL 

13 

12  (2  GSRP) 

6 

AFWAL/APL 

8 

6 

4 

ATL 

9 

8  (1  GSRP) 

6 

AEOC 

6 

4 

3 

AFWAL/AL 

9 

9  (1  GSRP) 

5 

LC 

1 

1 

1 

ESMC 

1 

0 

0 

ESD 

1 

1 

0 

ESC 

8 

8 

6 

AFWAL/FDL 

9 

8  (1  GSRP) 

6  (1  GSRP) 

FJSRL 

9 

5 

5 

AFGL 

13 

10  (1  GSRP) 

7 

HRL/OT 

2 

2 

1 

HRL/LR 

3 

3 

2 

HRL/MO 

3 

2 

2 

HRL/IO 

0 

0 

0 

LHC 

3 

1 

0 

AFWAL/HL 

13 

11  (1  GSRP) 

6  (1  GSRP) 

OEHL 

5 

3 

3 

AL 

7 

4 

3 

RADC 

11 

10 

7 

SAM 

16 

8 

6 

OEOMI 

2 

2 

0 

WL 

7 

6 

4 

Total 


159 


124 


83 


LIST  OF  UNIVERSITY  THAT  PARTICIPATED 


Adelphi  University  -  l 
Alabama,  University  of  -  l 
Alaska-Fairbanks,  Univ.  of  -  l 
Alfred  University  -  l 
Arizona  State  University  -  1 
Arkansas  State  University  -  1 
Arkansas,  University  of  -  1 
Auburn  University  -  l 
Bishop  College  -  i 
Capital  University  -  1 
Catholic  Univ.  of  America  -  1 
Cedarville  College  -  1 
Central  State  University  -  1 
Cincinnati,  University  of  -  5 
Colorado,  University  of  -  2 
Dayton,  University  of  -  7 
Dillard  University  -  1 
Drury  College  -  l 
Eastern  Illinois  University  -  1 
Eastern  Kentucky  University  -  1 
Eastern  New  Mexico  University  -  2 
Fairfield  University  -  1 
Florida  A&M  University  -  1 
Florida,  University  of  -  2 
Fort  Lewis  College  -  1 
Gonzaga  University  -  1 
Grambling  State  University  -  1 
Hampton  University  -  1 
Houston,  University  of  -  2 
Howard  University  -  i 
Idaho,  University  of  -  1 
Illinois-Chicago,  Univ.  of  -  2 
Indiana  University  -  1 
Indiana  Univ.  of  Pennsylvania  -  1 
Iowa,  University  of  -  1 
Jackson  State  University  -  1 
Jarvis  Christian  College  -  1 
Jesm  Baromedical  Res.  Inst.  -  1 
John  Hopkins  Evening  College  -  1 
Kansas  State  University  -  1 
Kansas,  University  of  -  1 
Kentucky,  University  of  -  1 


Meharry  Medical  College  -  1 
Memphis  State  University  -  1 
Metropolitan  State  College  -  1 
Michigan  State  University  -  1 
Mississippi  State  University  -  4 
Mississippi,  University  of  -  1 
Missouri-Kansas  City,  Univ.  -  1 
Missouri -Rol la ,  Univ.  of  ~  3 
Montana,  University  of  -  1 
Montclair  State  College  -  1 
Morehouse  College  -  1 
Nazareth  College  -  1 
Nebraska-Lincoln,  Univ.  of  -  2 
New  Mexico  State  University  -  1 
New  York  State,  Univ.  of  -  3 
N.  Carolina  A&T  State  Univ.  -  1 
N.  Carol ina-Greensboro,  Univ  -  1 
Northwestern  University  -  1 
Ohio  State  University  -  5 
Ohio  University  -  2 
Oklahoma  State  University  -  1 
Oregon  Institute  of  Tech.  -  1 
Oregon  State  University  -  1 
Ouachita  Baptist  University  -  1 
Pace  University  -  1 
Pennsylvania  State  Univ.  -  1 
Point  Loma  College  -  1 
Puerto  Rico-Mayaguez,  Univ.  -  l 
Purdue  University  -  1 
Rochester  Inst,  of  Tech.  -  1 
Rose-Hulman  Inst,  of  Tech.  -  2 
Saint  Paul ' s  College  -  1 
San  Francisco  State  Univ.  -  1 
South  Dakota  State  Univ.  -  1 
South  Florida,  University  of  -  2 
Southeastern  Mass.  Univ.  -  2 
Southern  Illinois  University  -  2 
Southern  Mississippi,  Univ.  -  1 
Southern  University  -  2 
St.  Louis  University  -  1 
St.  Mary's  University  -  1 
Talladega  College  -  1 
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LIST  OF  UNIVERSITY  THAT  PARTICIPATED 
Continued 


Lock  Haven  Univ.  of  Pennsyl.  -  1 
Long  Island  University  -  1 
Louisiana  State  University  -  1 
Louisiana  Tech.  University  -  1 
Lowell,  University  of  -  4 
Texas  Southern  University  -  3 
Texas  Technical  University  -  2 
Texas-Austin,  University  of  -  1 
Tuskegee  University  -  1 
Utah  State  University  -  l 
Walla  Walla  College  -  1 
Washington  State  University  -  1 
West  Florida,  University  of  -  1 
Western  Michigan  University  -  3 


Taylor  University  -  1 
Temple  University  -  1 
Tennessee  Technical  Univ.  -  1 
Tennessee,  University  of  -  1 
Texas  A&M  University  -  2 
Wichita  State  University  -  2 
Wilberforce  University  -  1 
Wisconsin-Eau  Claire  Univ.  -  2 
Wisconsin-Madison,  Univ.  of  -  1 
Wisconsin-Whitewater,  Univ.  -  1 
Wittenberg  University  -  1 
Worchester  Polytech.  Inst.  -  2 
Wright  State  University  -  3 
Xavier  University  -  1 
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PARTICIPANT  LABORATORY  ASSIGNMENT  (Page  1) 


AERO  PROPULSION  LABORATORY 
(Wright-Patterson  Air  Force  Base) 

Or.  Suresh  K.  Aggarwal 
Univ.  of  Illinois  at  Chicago 
Specialty:  Aerospace  Engineering 

Or.  Bryan  R.  Becker 
Rose-Hulman  Institute 
Specialty:  Engineering  Science 


Dr.  Richard  Tankin 
Northwestern  University 
Specialty:  Mechanical  Engineering 

Or.  Cheng-Hsiao  Wu 

Univ.  of  Missouri 

Specialty:  Solid  State  Physics 


ARMAMENT  LABORATORY 
(Eglin  Air  Force  Base) 


Or.  Charles  Bell 
Arkansas  State  University 
Specialty:  Mechanical  Engineering 

Or.  Robert  W.  Courter 
Louisiana  State  University 
Specialty: Aerospace  Engineering 

Or.  Joseph  J.  Feeley 
University  of  Idaho 
$pecialtv:Electrical  Engineering 


Or.  Elmer  C.  Hansen 
University  of  Florida 
Specialty: Mechanical  Engineering 

Or.  James  Hoffmaster 
Gonzaga  University 
Specialty: Physics 

Dr.  James  Nail 
Mississippi  State  Univ. 
Specialty: Electrical  Engineering 


Ms.  Jennifer  L.  Davidson  (1986),  (GSRP)  Dr.  Meckinley  Scott  (1986) 
University  of  Florida  University  of  Alabama 

Specialty:  Mathematics  Specialty:  Statistics 


Dr.  Mo  Samimy  (1986) 

Ohio  State  University 
Specialty:  Mechanical  Engineering 


Mr.  Jim  S.  Sirkis  (1986) , (GSRP) 
University  of  Florida 
Specialty:  Engineering  Mechanics 


HARRY  G.  ARMSTRONG  AEROSPACE  MEDICAL  RESEARCH  LABORATORY 
(Wright-Patterson  Air  Force  Base) 


Dr.  Praphulla  K.  Bajpai 
University  of  Oayton 
Specialty:  Immunology 

Dr.  Gwendolyn  Howze 
Texas  Southern  University 
Specialty:  Physics 


Or.  Thomas  Nygren 
Ohio  State  University 
Specialty:  Psychology 

Dr.  Donald  Robertson 
Indiana  University  of  PA 
Specialty:  Psychology 
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PARTICIPANT  LABORATORY  ASSIGNMENT  (Page  2) 


HARRY  G.  ARMSTRONG  AEROSPACE  MEDICAL  RESEARCH  LABORATORY 
(Wright-Patterson  Air  Force  Base) 

(continued) 

Or.  Noel  Nussbaum  Dr.  John  Westerkamp 

Wright  State  University  University  of  Dayton 

Specialty:  Biology  Specialty:  Electrical  Engineering 

Dr.  Jacqueline  Payer  (1986) 

Duke  Uniyersity 

Specialty:  Biomechanical  Engineering 


ARNOLD  ENGINEERING  DEVELOPMENT  CENTER 
(Arnold  Air  Force  Systems) 

Dr.  Suhrit  K.  Oey 

Eastern  Illinois  University 

Specialty:  Aerospace  Engineering 

Dr.  William  M.  Grissom 

Morehouse  College 

Specialty:  Mechanical  Engineering 


ASTRONAUTICS  LABORATORY 
(Edwards  Air  Force  Base) 

Or.  Gurbux  S.  Alag 

Western  Michigan  University 

Specialty:  Systems  Engineering 

Or.  John  Kenney 

Eastern  New  Mexico  University 

Specialty:  Physical  Chemistry 


AVIONICS  LABORATORY 
(Wright-Patterson  Air  Force  Base) 

Dr.  Vernon  L.  Bakke 
University  of  Arkansas 
Specialty:  Mathematics 


Dr.  Surgounda  Patil 
Tennessee  Technical  University 
Specialty:  Math  Statistics 


Or.  Lawrence  Schovanec 
Texas  Tech  University 
Specialty:  Mathematics 


Dr.  Narayan  C.  Haider 
University  of  South  Florida 
Specialty:  Physics 
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PARTICIPANT  LABORATORY  ASSIGNMENT  (Page  3) 


AVIONICS  LABORATORY 
(Wright-Patterson  Air  Force  Base) 
(continued) 

Prof .  William  K.  Curry 
Rose-Hulman  Inst,  of  Technology 
Specialty:  Computer  Science 

Dr.  Verlynda  S.  Dobbs 
Wright  State  University 
Specialty:  Computer  Science 

Dr.  George  W.  Zobrist  (1986) 
University  of  Missouri-Rol la 
Specialty:  Electrical  Engineering 


ENGINEERING  AND  SERVICES  CENTER 
(Tyndall  Air  Force  Base) 

Dr.  William  W.  Bannister 
University  of  Lowell 
Specialty:  Organic  Chemistry 

Or.  William  M.  Bass 

The  University  of  Tennessee 

Specialty:  Physical  Anthropology 

Dr.  Peter  Jeffers 
S.U.N.Y. 

Specialty:  Chemistry 

Or.  William  T.  Cooper  (1986) 
Florida  State  University 
Specialty:  Chemistry 


FLIGHT  DYNAMICS  LABORATORY 
(Wright-Patterson  Air  Force  Base) 

Mr.  Thomas  Enneking  (GSRP) 
University  of  Notre  Dame 
Specialty:  Civil  Engineering 


Dr.  Alastair  McAulay 
Wright  State  University 
Specialty:  Electrical  Engineering 

Or.  John  Y.  Cheung  (1986) 
University  of  Oklahoma 
Specialty:  Electrical  Engineering 


Dr.  William  Schulz 
Eastern  Kentucky  University 
Specialty:  Chemistry 

Dr.  Joseph  Tedesco 
Auburn  University 
Specialty:  Civil  Engineering 

Dr.  Dennis  Truax 
Mississippi  State  Univeristy 
Specialty:  Civil  Engineering 

Dr.  Yong  S.  Kim  (1986) 

The  Catholic  Univ.  of  America 
Specialty:  Civil  Engineering 


Dr.  Gary  Slater 
University  of  Cincinnati 
Specialty:  Aerospace  Engineering 
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FLIGHT  DYNAMICS  LABORATORY 
(Wright-Patterson  Air  Force  Base) 
(continued) 

Dr.  Oliver  McGee 
Ohio  State  University 
Specialty:  Engineering  Mechanics 

Or.  Shiva  Singh 
Univ.  of  Kentucky 
Specialty:  Mathematics 

Dr.  George  R.  Doyle  (1986) 
University  of  Dayton 
Specialty:  Mechanical  Engineering 


Or.  Forrest  Thomas 
University  of  Montana 
Specialty:  Chemistry 

Dr.  William  Wolfe 
Ohio  State  University 
Specialty:  Engineering 

Dr.  V.  Oakshina  Murty  (1986) 
University  of  Portland 
Specialty:  Engineering  Mechanics 


Or.  Tsun-wai  G.  Yip  (1986) 

Ohio  State  University 

Specialty:  Aeronautics-Astronautics  Engineering 


FRANK  J.  SEILER  RESEARCH  RESEARCH  LABORATORY 
(United  State  Air  Force  Academy) 


Dr.  Charles  M.  Bump 
Hampton  University 
Specialty:  Organic  Chemistry 

Or.  Stephen  J.  Gold 

South  Dakota  State  University 

Specialty:  Electrical  Engineering 

Or.  Henry  Kurtz 
Memphis  State  Univ. 

Specialty:  Chemistry 


GEOPHYSICS  LABORATORY 
(Hanscom  Air  Force  Base) 

Dr.  Lee  A.  Flippin 

San  Francisco  State  Univ. 

Specialty:  Organic  Chemistry 


Dr.  Howard  Thompson 

Purdue  University 

Specialty:  Mechanical  Engineering 

Or.  Melvin  Zandler 
Wichita  State  Univ. 

Specialty:  Physical  Chemistry 


Dr.  Gandikota  Rao 
St.  Louis  Uniyersity 
Specialty:  Meteorology 
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GEOPHYSICS  LABORATORY 
(Hanscom  Air  Force  Base) 
(continued) 

Or.  Mayer  Humi 
WPI 

Specialty:  Applied  Mathematics 

Or.  Steven  Leon 
Southeastern  Massachusettes 
Specialty:  Mathematics 

Dr.  Henry  Nebel 
Alfred  University 
Specialty:  Physics 


Or.  Timothy  Su 

Southeastern  Massachusetts  Univ. 
Specialty:  Physical  Chemistry 

Dr.  Keith  Walker 
Point  Loma  College 
Specialty:  Physics 


HUMAN  RESOURCES  LABORATORY 

(Brooks,  Williams  and  Wright-Patterson  Air  Force  Base) 


Or.  Patricia  A.  Carlson 
Rose-Hulman  Inst,  of  Technology 
Specialty:  Literature/Language 

Or.  Ronna  E.  Dillon 
Southern  Illinois  University 
Specialty:  Educational  Psychology 

Or.  Michael  Matthews 
Drury  College 
Specialty:  Psychology 


Dr.  John  Uhlarik 
Kansas  State  University 
Specialty:  Psychology 

Dr.  Charles  Wells 
University  of  Dayton 
Specialty:  Management  Science 

Dr.  Charles  Lance  (1986) 
University  of  Georgia 
Specialty:  Psychology 


Dr.  Stephen  Loy  (1986) 

Iowa  State  University 

Specialty:  Management  Information  Sys. 


Dr.  Jorge  Mendoza 
Texas  A&M  University 
Specialty:  Psychology 


Dr.  Doris  Walker-Oalhouse  (1986) 
Jackson  State  University 
Specialty:  Reading  Education 


Dr.  Billy  Wooten  (1986) 

Brown  University 

Specialty:  Philosophy,  Psychology 


PARTICIPANT  LABORATORY  ASSIGNMENT  (Page  6) 


LOGISTICS  COMMAND 
(Wright-Patterson  Air  Force  Base) 

Or.  Howard  Weiss 

Specialty:  Industrial  Engineering 
Temple  University 


MATERIALS  LABORATORY 
(Wright-Patterson  Air  Force  Base) 

Dr.  Bruce  A.  OeVantier 
S.  Illinois  University 
Specialty:  Civil  Engineering 

Or.  Ravinder  Oiwan 
Southern  University 
Specialty:  Metallurgy 

Or.  Bruce  A.  Craver 
University  of  Oayton 
Specialty;  Physics 

Or.  Robert  Patsiga  (19B6) 

Indiana  Univ,  of  Pennsylvania 
Specialty:  Organic  Polymer  Chemistry 

Or.  Gopal  M.  Mehrotra  (1986) 

Wright  State  University 
Specialty;  Metallurgy 


Dr.  John  W.  Gilmer 
Penn  State  University 
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ABSTRACT 

A  detailed  numerical  study  of  the  aerothermal  mechanics  within  a 
short  straight  section  of  a  turbine  blade  internal  cooling  passage 
with  a  single  turbulator  is  described.  The  two  dimensional, 
transient,  Reynolds  averaged  Navier  Stokes,  continuity  and  energy 
equations  are  iterated  to  a  steady  state  solution  using  the 
MacCormack  explicit  predictor-corrector  algorithm.  Turbulence 
closure  is  achieved  through  the  use  of  the  Baldwin-Lomax  algebraic 
two  layer  eddy  viscosity  model. 

Axial  distributions  of  local  skin  friction  coefficient,  local 
Stanton  number  and  local  Nusselt  number  are  given.  Profiles  of 
velocity  and  temperature  are  presented  as  well  as  contours  of 
streamfunction  and  temperature  which  display  complex  details  of  the 
flow  structure.  It  was  found  that  the  widely  used  Reynolds  Analogy 
greatly  underpredicts  the  heat  transfer  rate  as  given  by  a  direct 
calculation  using  Fourier's  law.  Finally,  the  numerical  results 
are  found  to  compare  favorably  to  the  experimental  results 
published  by  Han  et  al  (1986). 
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I.  INTRODUCTION 


Advanced  gas  turbine  engines  will  be  designed  for  higher  thermal 
efficiency  and  specific  thrust  which  will  necessitate  higher 
operating  speeds,  temperatures,  and  pressures.  As  a  consequence, 
the  heat  loads  on  the  turbine  blades  will  increase,  which  can 
result  in  both  high  metal  temperatures  and  thermal  stress 
concentrations  due  to  severe  thermal  gradients.  To  reduce  these 
effects,  cooling  air  is  extracted  from  the  compressor  and  is  routed 
through  small  rectangular  cooling  passages  within  the  turbine 
blades. 

As  shown  in  Figure  1,  the  cooling  air  enters  the  root  of  each 
turbine  blade  and  flows  through  complex,  serpentine  passages  until 
it  exits  through  the  blade's  surface  into  the  cavity  surrounding 
the  turbine  rotor-strator .  Since  the  pressure  drop,  and  thus  the 
mass  flowrate,  along  these  cooling  passages  is  limited  by  the 
pressure  differential  between  the  compressor  and  the  turbine  stage, 
methods  other  than  increased  coolant  flow,  such  as  the  introduction 
of  turbulators,  must  be  used  to  enhance  the  heat  transfer.  These 
turbulators  or  turbulence  promoters,  which  consist  of  ribs 
protruding  from  the  walls  of  the  internal  cooling  passages, 
increase  the  turbulence  level  within  the  cooling  air  flow  and  thus 
augment  the  rate  of  energy  transfer  from  the  turbine  blade  through 
the  cooling  passage  wall  and  into  the  cooling  air  flow.  Thus,  the 
resulting  coolant  flows  contain  many  complex  phenomena  which  are 
not  adequately  included  in  the  current  heat  transfer  design  of 
turbine  blades.  The  detailed  modeling  of  these  flows  is  essential 
to  the  accurate  prediction  of  the  distribution  of  the  local  heat 
transfer  coefficient  along  the  cooling  passages.  Such  prediction 
is  necessary  to  achieve  a  thermal  design  of  these  components  which 
incorporates  efficient  cooling  and  thereby  alleviates  the  effects 
of  the  increased  heat  load. 
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There  have  been  numerous  experimental  and  analytical  studies  of 
the  flow  field  and  heat  transfer  within  straight  rectangular  ducts 
which  simulate  the  flow  within  the  internal  cooling  passages  inside 
a  turbine  blade.  However,  there  remains  work  to  be  done  in  the 
detailed  modeling  of  the  heat  transfer  and  fluid  flow  processes 
very  near  the  fluid-solid  interface.  This  paper  reports  on  a  two 
dimensional  modeling  effort  which  begins  to  address  this  need. 

During  my  tenure  as  a  Summer  Research  Fellow  in  the  1987  Summer 
Faculty  Research  Program  (SFRP),  I  began  modeling  of  the  fluid  flow 
and  heat  transfer  in  a  straight  rectangular  passage  with  a  single 
turbulence  promoter.  This  work  was  completed  and  a  technical  paper 
was  written  during  the  Spring  1988  term  with  funding  by  the  1987 
Mini  Grant  Program.  This  paper,  "A  Numerical  Simulation  of  the 
Flow  Field  and  Heat  Transfer  in  a  Rectangular  Duct  with  a 
Turbulator  Using  a  Predictor-Corrector  Algorithm,”  (AIAA-88-2903) , 
was  presented  at  the  AIAA/ASME/SAE/ASEE  24th  Joint  Propulsion 
Conference,  July  11-13,  1988  in  Boston,  MA  (11. 

This  first  objective  of  this  1987  RIP  research  effort  was  to 
modify  the  numerical  model  developed  during  the  summer  of  1987  by 
incorporating  true  symmetry  boundary  conditions  at  the  upper 
boundary  of  the  computational  mesh: 

ff"  =  0  (1) 

for  all  dependent  variables  .  The  resulting  numerical  model  was 
then  iterated  to  a  well  converged  steady  state  solution. 

The  second  objective  was  to  develop  a  post  processor  program  to 

read  and  analyze  the  converged  solution.  After  reading  the 

solution  from  an  online  data  file,  this  post  processor  first 

calculates  the  L2-Norm  of  the  error  in  the  solution.  It  then 

generates  a  plot  data  set  which  contains  the  information  necessary 

to  plot  nondimensional  velocity  and  temperature  profiles.  It  then 

calculates  the  axial  distributions  of  the  local  skin  friction, 

local  Stanton  Number  (via  both  Reynold's  Analogy  and  Fourier's 

Law),  as  well  as  the  local  Nusselt  Number  scaled  by  Me Adams -Dittus- 

Boelter  correlation  for  fully  developed  Nusselt  Number  in 
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rectangular  ducts.  Following  these  calculations,  the  post 
processor  appends  the  experimental  data  reported  by  J.  C.  Han  et  al 
[2J  and  then  generates  data  sets  for  plotting  all  of  the  above 
mentioned  quantities  as  a  function  of  distance  along  the  duct. 

This  program  then  integrates  both  the  calculated  and  the 
experimental  Nusselt  numbers  to  determine  axial  average  values  of 
these  quantities.  Finally,  the  post  processor  integrates  the 
velocity  distribution  throughout  the  computational  mesh  so  as  to 
evaluate  the  streamfunction.  It  then  writes  a  third  plot  data  set 
containing  the  information  necessary  for  plotting  contours  of 
streamfunction  and  nondimensional  temperature. 

The  third  and  final  objective  of  the  1937  RIP  research  period  was 
to  begin  work  on  a  numerical  model  of  a  rectangular  passage  with 
four  turbulators.  The  computational  grid  shown  in  Figures  2  and  3 
was  generated  for  this  model.  Figure  2  shows  the  entire 
computational  domain  containing  an  entrance  region,  four 
turbulators,  and  a  downstream  region.  Figure  3  shows  details  of 
the  first  ten  rib  heights  of  the  mesh.  It  can  be  seen  that  the 
grid  is  very  fine  near  the  lower  boundary  and  in  the  vicinity  of 
the  top  of  the  rib.  The  MacCormack  Predictor-Corrector  algorithm 
was  modified  to  account  for  the  boundaries  of  the  four  turbulator 
surfaces.  Every  effort  was  made  to  accurately  model  the  exact 
conditions  reported  by  Han  et  al  (2).  At  the  close  of  the  1987 
RIP  research  period  this  model  is  still  in  the  "development  and 
check-out"  phase.  It  is  intended  that  this  work  will  be  continued 
at  ray  university  with  additional  funding  from  the  Air  Force  Office 
of  Scientific  Research. 
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IMPROVED  SINGLE  TaRBULATOR  MODE 


As  stated  in  Section  II,  the  first  objective  of  this  summer's 
research  was  to  improve  the  single  turbulator  model  which  was 
developed  during  the  1987  SFRP  research  period.  This  imporved 
model  was  then  iterated  to  a  well  converged  solution  which  was 
analyzed  by  the  post  processor  developed  during  this  1987  RIP 
research  period.  The  computational  mesh  used  in  the  single 
turbulator  model  is  shown  in  Figure  4. 

A.  Convergence 

Iteration  towards  the  improved  solution  was  begun  by  using  the 
earlier  solution  as  an  initial  condition.  The  previous  solution 
was  achieved  after  100,000  iterations  and  then  an  additional 
100,000  iterations  were  performed  with  the  improved  single 
turbulator  model.  The  convergence  of  the  predictor-corrector 
algorithm  is  measured  by  the  L-2  norm  of  the  change  in  the 
normalized  global  solution  vector.  Typically,  in  the  iteration  to 
a  converged  steady  state  solution,  the  value  of  this  norm  will  drop 
3  orders  of  magnitude  from  the  initial  conditions  and  then 
oscillate  about  this  reduced  value,  indicating  that  convergence  has 
been  achieved  [3],  In  the  earlier  simulation,  the  value  of  this 
norm  decreased  from  7.137  at  the  outset  to  0.002662  after  50,000 
iterations  and  oscillated  near  that  value  for  the  next  50,000 
iterations.  After  the  symmetry  boundary  condition  was  improved, 
the  solution  was  restarted  and  the  norm  dropped  to  less  than  0.0008 
within  the  first  50,000  iterations  and  continued  to  oscillate  just 
below  that  value  for  the  final  50,000  iterations.  This  behavior  is 
typical  of  convergence  to  the  steady  state  solution. 

B.  Streamfunction  Contours 

Streamfunction  contours  over  the  entire  computational  domain  are 

shown  in  Figure  5.  As  shown  in  this  figure,  counterclockwise 

vortices  are  present  both  upstream  and  downstream  of  the  turbulator 

rib.  Reattachment  of  the  large  separated  flow  behind  the 

54-5 


turbulator  occurs  approximately  nine  to  ten  rib  heights  downstream 
of  the  rib.  Figure  6  shows  the  size  of  the  counterrotating  vortex 
upstream  of  the  rib.  Figure  7  gives  detail  of  the  flow  structure 
atop  the  turbulator  and  shows  the  presence  of  a  long  thin 
counterrotating  vortex  centered  over  the  rib.  Detail  of  the  large 
counterclocicvise  recirculation  behind  the  turbulator  is  shown  in 
Figure  8.  The  dividing  streamline  departs  from  the  rib  near  the 
top  corner.  The  small  bubble  located  at  the  point  where  the  rib 
rejoins  the  horizontal  surface  indicates  the  presence  of  a  small 
clockwise  rotating  vortex  at  that  juncture  which  is  shown  in  Figure 
9. 


C.  Temperature  Contours 

The  flow  field  temperature  distribution  is  shown  in  Figure  10. 
Upstream  of  the  turbulator,  very  little  mixing  is  occurring  as 
indicated  by  the  concentration  of  temperature  contours  near  the 
horizontal  surface.  The  bulk  of  the  approaching  fluid  is  slightly 
warmer  than  the  free  stream  value,  T«  ,  as  indicated  by  the 
uppermost  contour  which  attaches  at  the  front  corner  of  the  rib. 
This  slight  warming  is  due  to  heat  conduction  from  the  hot 
isothermal  wall.  Downstream  of  the  rib,  the  turbulent  mixing  of 
the  fluid  has  greatly  diffused  the  thermal  energy  upward,  well 
beyond  the  first  rib  height.  The  thickness  of  this  warm  air 
region  grows  downstream  of  the  turbulator.  Just  behind  the  rib, 
the  temperature  distribution  shown  in  Figure  11,  reflects  the 
effects  of  the  complex  flow  structure  which  exists  there. 

0.  Velocity  and  Temperature  Profiles 

Figures  12  through  15  present  profiles  of  u/u^^  and  T/T« 
plotted  against  Y/YHAX  at  9  streamwise  locations,  X/H,  where  H  is 
the  rib  height.  The  profiles  at  X/H  =  0.0  represent  the  specified 
inflow  boundary  conditions. 

At  X/H  =  3.0  and  4.0,  the  profiles  display  the  development  of  the 
momentum  and  thermal  boundary  layers.  At  X/H  =  4.0,  the  horizontal 
velocity  is  negative  near  the  solid  wall  which  indicates  the 

presence  of  the  upstream  recirculation  zone  shown  in  Figure  6. 
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The  locations/  X/H  »  5.0,  5.5,  and  6.0,  are  at  the  front  top 
corner,  nidline  and  back  top  corner  of  the  rib,  respectively.  At 
X/H  s  5.0,  the  horizontal  velocity  has  increased  dramatically  near 
the  solid  vail  as  the  flow  accelerates  over  the  rib  which  results 
in  a  corresponding  decrease  in  flow  field  temperature  since  the 
total  energy  of  the  flow  must  be  preserved.  By  X/H  «  5.5,  the  flow 
has  begun  to  decelerate  with  a  corresponding  increase  in 
temperature.  At  X/H  «  6.0  these  trends  continue. 

The  streamvise  locations,  X/H  «  7.0  and  10.0,  are  located  at  the 
back  bottom  corner  of  the  rib  and  at  the  midline  of  the  large 
recirculation  bubble  which  exists  downwind  of  the  rib.  The 
temperature  profiles  show  that  the  turbulent  mixing  behind  the  rib 
has  greatly  diffused  the  energy  from  the  hot  solid  wall  to  form  a 
warm  air  region  which  extends  more  than  1.0  rib  height  vertically 
from  the  solid  wall.  The  horizontal  velocity  profile  is  typical  of 
a  recirculation  zone  with  a  negative  upstream  flow  near  the  wall 
changing  to  a  positive  downstream  flow  higher  in  the  flow  field. 

The  profiles  at  X/H  «  20.0,  demonstrate  that  both  the  momentum 
and  thermal  boundary  layers  are  reestablished  downstream  of  the 
recirculation  bubble.  However,  comparison  of  these  profiles  to 
those  upstream  of  the  turbulence  promoter,  show  that  the  downstream 
boundary  layers  are  much  thicker  and  well  mixed,  illustrating  the 
effects  of  the  intense  turbulent  mixing  caused  by  the  flow  over  the 
turbulence  promoter. 


E.  Skin  Friction  and  Stantoa  Kumber 
The  axial  distribution  of  the  local  skin  friction,  shown  in 
Figure  16,  was  calculated  from  the  following  equation: 

rail  _ 

Jp  u  * 

*^m  m 


1 1  m 


The  normal  derivative  of  the  horizontal  velocity  was  evaluated  at 
the  wall  using  a  second  order  accurate  three  point  difference 
expression.  The  viscosity  was  determined  as  a  function  of 
temperature  at  the  wall.  As  shown  in  Figure  16,  the  skin  friction 
achieves  a  maximum  at  approximately  X/H  *  5.0  which  corresponds  to 
the  very  steep  velocity  gradient  near  the  wall  as  the  flow 
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accelerates  over  the  rib.  As  mentioned  earlier,  this  reduces  the 
temperature  of  the  fluid  near  the  wall  which  results  in  a  steep 
thermal  gradient  there  and  the  high  heat  transfer  rate  displayed  in 
Figures  17  and  18. 

The  axial  distribution  of  the  local  Stanton  Number,  given  in 
Figure  17,  was  evaluated  from  the  following  equation  using  two 
different  methods: 

"  c  P  u  (T 
P  -  -'  wall 

In  the  first  method,  the  Reynolds  Analogy  was  used,  in  which  the 
heat  transfer  rate  is  related  to  the  skin  friction  as  follows: 

Stanton  Number  * 

This  method  is  widely  used  in  turbomachinery  calculations  but  these 
calculations  greatly  underpredict  the  heat  transfer  rates  measured 
experimentally. 

In  the  second  method,  Fourier's  Conduction  Law  was  used  to 
evaluate  the  heat  transfer  rate  at  the  wall,  Qwmii,  which  appears 
in  Equation  ( 3] : 

^all  " 

^  ^  (5) 

In  this  calculation,  the  normal  derivative  of  temperature  at  the 
wall  was  calculated  in  a  manner  similar  to  that  used  fox  the  normal 
derivative  of  the  horizontal  velocity  discussed  above  and  the 
thermal  conductivity  was  evaluated  as  a  function  of  temperature  at 
the  wall. 

In  the  convection  heat  transfer  process,  the  energy  is  first 
transferred  by  conduction  from  the  solid  wall  to  the  fluid 
particles  adjacent  to  the  wall.  These  fluid  particles  are  then 
transported  away  from  the  solid  wall  by  the  bulk  mixing  of  the 
fluid  which  results  in  the  diffusion  of  energy  from  the  solid  wall. 
The  basis  for  the  Reynolds  Analogy  is  the  assumption  that  the  same 
mechanism  is  responsible  for  the  exchange  of  both  heat  and 
momentum.  This  assumption  disregards  the  details  of  the  energy 
transfer  process  at  the  solid  wall-fluid  interface.  As  shown  in 
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Figure  17,  the  direct  calculation  o£  the  heat  transfer  rates  via 
Fourier's  Law  predicts  heat  transfer  rates  which  can  be  as  great  as 
3  times  that  given  by  the  Reynolds  Analogy.  This  discrepancy  can 
be  explained  by  the  fact  that  the  Reynolds  Analogy  is  derived  for 
laminar  or  fully  turbulent  flows  over  a  flat  plate  with  laminar  and 
turbulent  Prandtl  number,  Pr  »  pr*  >  1.0.  Thus,  the  Reynolds 
Analogy  is  not  applicable  to  flows  with  recirculation  zones  and 
flow  reversals  which  cause  the  normal  gradient  of  velocity  to 
differ  greatly  from  the  normal  gradient  of  temperature. 


F.  Comparison  of  Numerical  to  Experimental  Results 

In  an  effort  to  validate  the  numerical  algorithm,  the  numerical 
results  were  compared  to  the  experimental  results  published  by  Han 
et  al  [21.  In  the  work  by  Han  et  al,  parametric  studies  of  the 
heat  transfer  and  friction  characteristics  in  rectangular  channels 
with  turbulators  were  performed.  The  parameters  of  the  test  case 
which  is  most  similar  to  the  numerical  simulation  are  as  follows: 
Reo  =  60,000 

channel  width/channel  height  *  4 
rib  angle  of  attack  =  90® 
rib  pitch/rib  height  *  20 

For  this  test  case,  Han  et  al  report  that  reattachment  occurs 
about  two  rib  heights  downstream  from  separation.  As  discussed 
earlier,  the  numerical  results  predict  reattachment  nine  to  ten  rib 
heights  downstream  of  the  rib.  This  great  disparity  is  a  result  of 
the  difference  in  rib  shapes.  As  shown  in  Fighre  18,  the 
experimental  rib  is  square  in  cross  section  and  much  smaller  than 
the  'flatten  sin  curve”  used  in  the  numerical  model.  The  gentle 
slope  of  the  rib  in  the  numerical  solution  permits  the  flow 
downstream  of  the  rib  to  straighten  more  gently,  thus  preserving 
its  horizontal  momentum  and  lengthening  the  recirculation  zone. 

In  Figure  18,  the  calculated  axial  distribution  of  the  Nusselt 
Number  is  compared  to  experimental  results  reported  by  Han  et  al 
[21.  In  the  calculation  of  the  Nusselt  Number  the  heat  transfer  at 
the  wall  was  evaluated  by  using  Equation  (5). 

In  the  comparison  of  the  numerical  results  to  the  experimental 
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results,  it  is  necessary  to  take  into  account  the  difference  in 
resolution:  the  experimental  results  are  given  as  nine  data  points 
while  the  calculated  distribution  contains  100  data  points.  Thus, 
the  experimental  results  represent  the  local  average  Nusselt  Number 
distribution  which  is  much  smoother  than  the  detailed  pointwise 
distribution  given  by  the  numerical  calculation. 

In  spite  of  this  difference  in  resolution,  both  the  numerical  and 
experimental  results  show  similar  trends.  Both  exhibit  high  heat 
transfer  near  the  upstream  corner  of  the  rib  with  minimum  heat 
transfer  just  behind  the  downstream  edge  of  the  turbulator.  Both 
distributions  show  the  heat  transfer  first  increasing  and  then 
decreasing  as  X/D  increases  downstream  of  the  turbulator. 

The  average  values  of  these  distributions  were  determined  by 
integrating  along  the  axis  of  the  rectangular  channel.  The  average 
of  the  calculated  Nusselt  Number  was  found  to  be  1.5123  while  the 
average  of  the  experimental  Nusselt  Number  was  found  to  be  2.3028. 
Thus  the  average  calculated  Nusselt  Number  is  approximately  34\ 
lower  than  the  average  experimental  Nusselt  Number. 

This  difference  can  be  attributed  to  several  factors.  Again,  the 
rib  shape  has  a  profound  effect  on  the  flow  field  and  hence  the 
heat  transfer.  The  experimental  data  includes  3-d  effects  which 
are  not  present  in  the  numerical  model.  The  Baldwin-Lomax 
turbulence  model  may  not  be  adequate  to  predict  the  turbulence 
effects  in  the  recirculation  zone.  Another  important  factor  may  be 
the  difference  in  free  stream  velocity.  Han  et  al  report  an 
average  free  stream  velocity  of  approximately  70.0  ft/sec  while  the 
corresponding  value  used  in  the  numerical  prediction  was  270.0 
ft/sec.  The  Reynolds  number  based  on  hydraulic  diameter  in  both 
cases  was  60,000.  However,  since  the  numerical  model  is  2-d,  the 
channel  width  in  the  simulation  is  arbitrary  and  was  chosen  to  be 
four  times  the  channel  height  since  that  was  the  highest  aspect 
ratio  reported  by  Han  et  al.  Finally,  the  experimental  test  set  up 
included  several  ribs  along  the  duct  while  the  numerical  model 
contained  only  a  single  rib.  Han  et  al  [2]  report  that  the  heat 
transfer  results  do  not  settle  into  a  periodic  pattern  until  the 

flow  has  gone  over  the  first  three  ribs. 
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Several  of  these  possible  causes  of  error  will  be  eliminated  in 
the  four  turbulator  model.  Obviously  the  number  of  ribs  is  greater 
than  the  three  required  to  establish  the  periodic  results  reported 
from  the  experiment.  The  free  stream  velocity  will  be  set  equal  to 
that  reported  by  Han  et  al.  Also  the  turbulator  size,  geometry, 
pitch  and  angle  of  attack  will  duplicate  the  experimental 
conditions.  This  work  will  by  continued  at  the  University  of 
Missouri  with  funding  from  the  Air  Force  Office  of  Scientific 
Research. 


G.  Conclusions 

In  conclusion,  the  flow  field  and  heat  transfer  along  a  turbine 
blade  internal  cooling  passage  was  numerically  simulated  using  a 
two  dimensional  geometry.  Attention  was  focused  on  the  flow  along 
a  short  straight  section  of  a  rectangular  passage  with  a  single 
transverse  mounted  turbulator  protruding  from  its  floor. 

In  contrast  to  the  work  done  by  earlier  investigators,  this  study 
was  performed  using  a  computational  mesh  which  had  very  high 
resolution.  There  were  ICO  grid  lines  in  the  horizontal  (axial) 
direction  clustered  around  the  turbulator  and  60  grid  lines  in  the 
vertical  direction  with  44  within  the  first  rib  height  which  was 
well  within  the  turbulent  boundary  layer.  This  high  resolution 
mesh  revealed  considerable  detail  of  the  flow  field  and  heat 
transfer  near  the  turbulator. 

The  calculation  revealed  a  flow  structure  near  the  turbulator 
which  is  more  complex  than  is  ordinarily  expected  for  such  a  simple 
passage/turbulator  geometry  and  has  a  pronounced  effect  upon  the 
heat  transfer  as  shown  in  Figures  17  and  18.  This  significant 
insight  indicates  that  further  study  is  needed  into  the  effects  of 
turbulator  geometry  and  spacing  (or  pitch)  as  a  means  of  heat 
transfer  enhancement.  Furthermore,  this  calculation  revealed  a 
considerable  difference  between  the  heat  transfer  rate  calculated 
by  using  the  Reynolds  Analogy  and  that  given  by  a  direct 
calculation  using  Fourier's  law.  This  discrepancy  results  from  the 
complex  flow  structure  near  the  turbulator  which  causes  the  normal 

gradient  of  velocity  to  differ  greatly  from  the  normal  gradient  of 
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temperature.  Similar  flow  structures  exist  throughout  many 
turbomachinery  components,  and  therefore,  corresponding  heat 
transfer  effects  may  occur  in  those  components  which  would  also  be 
inadequately  predicted  by  the  Reynolds  Analogy.  This  significant 
insight  may  lead  to  remedies  for  the  underprediction  of  heat 
transfer  rates  in  many  turbomachinery  calculations. 

Finally,  the  numerical  results  were  found  to  compare  favorably 
with  reported  experimental  results.  Both  sets  of  results  display 
similar  trends.  Comparison  of  the  axially  averaged  Nusselt  Number 
shows  34%  error  which  is  reasonable  considering  the  difference  in 
rib  geometry,  number  of  ribs,  and  free  stream  velocity  as  well  as 
the  absence  of  3-d  effects  in  the  numerical  model. 
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As  outlined  in  Section  II  of  this  report,  work  has  begun  on  a 
four  turbulator  model  and  as  discussed  in  the  conclusions  of 
Section  III,  this  model  will  remove  many  of  the  causes  of 
discrepancy  between  the  numerical  results  and  the  experimental 
results.  It  is  recommended  that  this  modeling  effort  be  completed 
and  the  algorithm  validated  by  comparison  with  experimental 
results.  After  this  has  been  done  several  parametric  studies  can 
be  performed. 

First,  a  grid  resolution  study  will  be  done  by  producing  a 
converged  solution  using  a  course  mesh  formed  by  omitting  every 
other  grid  line  shown  in  Figures  2  and  3.  This  converged  solution 
will  then  be  interpolated  to  the  fine  mesh  of  Figures  2  and  3. 
Iteration  will  then  be  restarted.  Thus,  the  grid  dependence  of  the 
solution  can  be  determined.  Also,  this  should  produce  an  efficient 
solution  strategy  since  part  of  the  iteration  will  be  done  using 
only  a  quarter  of  the  total  grid  points. 

Second,  using  the  refined  mesh,  parametric  studies  of  the 
effects  of  Reynolds  Number  and  rib  pitch  can  be  done  and  compared 
to  the  experimental  results. 

Third,  the  ribbed  surface  could  be  reflected  about  the  duct  line 
of  symmetry  and  the  effects  of  top  and  bottom  rib  alignment 
studied. 

Fourth,  various  turbulence  models  could  be  incorporated  in  the 
algorithm  and  these  effects  upon  flow  field  structure  and  heat 
transfer  ascertained. 

Finally,  this  work  should  eventually  progress  towards  a  full  3-d 
Navier  Stokes  solution  which  could  then  include  effects  of  rib 
angle  of  attack  and  duct  rotation. 
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ABSTRACT 


This  report  describes  work  on  an  algorithm  constructed  in 
connection  with  the  estimation  of  frequencies  in  radar  signals. 

In  the  event  multiple  frequencies  are  present  in  a  signal,  the 
Prony  method  is  applied  using  sampled  data  with  a  time  lag  large 
enough  so  that  direct  calculation  of  the  frequencies  is  possible 
only  up  to  modulus  the  principal  value.  The  algorithm  discussed 
here  is  an  attempt  to  resolve  this  problem.  Since  the  problem  is 
shown  to  be  inherently  unstable,  several  adaptations  are  proposed 
to  overcome  some  of  the  sources  of  error. 
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1 .  Introduction. 

The  problem  of  estimating  frequency  of  radar  pulses  has  received 
considerable  attention  (see,  e.g.,  (51  and  the  bibliography).  Methods  of 
collecting  the  data  from  a  signal  vary,  and  in  all  cases  numerical 
software  must  be  constructed  to  estimate  the  associated  frequencies. 

Once  the  data  has  been  collected,  some  numerical  scheme  or  schemes 
are  employed  to  derive  estimates  for  parameters  related  to  the  signal 
frequency,  phase  or  amplitude.  The  most  popular  tools  are  related  to 
transform  techniques,  such  as  the  Fourier  Transform  or  the  Walsh 
Transform.  For  a  high  level  accuracy,  however,  a  large  number  of  sample 
points  is  necessary.  Another  numerical  method  which  can  be  adapted  for 
this  problem  is  the  Prony  method  13).  In  this  report,  the  Prony  method 
and  its  relation  to  the  problem  at  hand  will  be  discussed.  In  [1]  an 
algorithm  for  implementation  of  the  Prony  method  was  proposed  and  this 
report  presents  further  study  of  the  problem  and  adaptations  in  the 
algorithm.  In  particular  the  problem  of  computing  estimates  of  several 
frequencies  occuring  in  a  signal  will  be  examined,  and  the  stability  of  the 
method  will  be  studied. 

Section  2  outlines  the  Prony  method  for  approximating  the  frequencies 
in  a  signal.  In  section  3  a  discussion  of  the  unstable  nature  of  this 
algorithm  is  presented  by  deriving  a  factorization  of  the  system.  Section 

4  contains  a  discussion  of  the  adaptations  to  these  instabilities  and  the 
algorithm  for  pairing  is  discussed.  Some  examples  are  presented  in  section 

5  and  a  scheme  for  the  reduction  of  the  sensitivity  to  error  is  proposed. 
Concluding  remarks  follow  in  section  6. 
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2.  nte  SENSB&L  METHOD 


In  this  section  we  examine  the  process  after  the  data  has  been 
collected.  Thus,  assume  we  have  a  signal  S(t)  which  is  of  the  form 


(1) 


S(t) 


P 

Z 

k-1 


p^^2ni(o^t 


where  Pj^,  and  '^3^  are  the  amplitudes,  frequencies  and  phase  angles. 

respectively.  The  digitized  data  is  collected  via  an  IFM  receiver  by 
passing  the  signal  pulse  through  a  time  delay  of  known  length  t.  This  is 
essentially  equivalent  to  viewing  the  signal  as  being  sampled  at  times  0,t. 
2t. . . . ,Nt.  Thus,  we  may  use  (1)  to  write 


(2) 


S  (nr) 


P 

Z 

k-1 


Pi,e 


2ni(uj^nT  + 


P 

Z 

k»l 


2nicj3^nT 


2rTi<P,, 

n  -  0,1 . Nt.  where  qj^  -  Pj^e  k  =  0,1 . p.  Since  we  are 

interested  in  calculating  the  unknown  values  k  -  1,2 . p,  we  then 

use  these  sampled  values  to  construct  a  polynomial  P(z)  which  has  roots 
2n iw„T 

Zy.“e  '^,r  =  l,2,...,p.  Hie  following  method,  called  the  Prony 
method  (see  [3])  is  a  scheme  to  obtain  these  values. 

If  P(z)  =  zP  +  ajzP"^  +  . . .  +  ap  is  the  polynomial  we  seek,  it 
follows  that  since 


57-3 


®l2^^k+2p-2^  ^  ^22(^k+2p-3^  +  ...  +  ^pS(t)^+p_2)  =  -S(tj^+2p) 

If  we  solve  (3)  for  the  coefficients  a^,  i  -  1.2 . p,  we  have  the 

polynomial  P(z)  and  we  can  find  the  roots  of  P(z)  using  a  method 
such  as  Newton's  method.  In  general,  the  system  (3)  will  be  low  order 
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(p  <  10)  and  a  direct  method  such  as  Gaussian  Elimination  will  be 
suitable  for  solving  (3) .  In  practice,  the  system  (3)  corresponds  to  the 
autoregressive  model,  where  the  matrix  coefficients  S(tj)  are  the  auto¬ 
correlation  coefficients.  Thus  with  a  matrix  of  this  type  one  could  use  a 
factoring  algorithm  designed  for  symmetric,  positive  definite  matrices, 
such  as  a  Cholesky  Decomposition.  In  all  the  examples,  a  Gaussian 
Elimination  code  for  complex  matrices  was  used. 

The  polynomial  P(z)  now  has  complex  coefficients,  and  for  p  >  2  an 
iterative  method  (Newton's  method)  of  order  2  is  used.  Generally,  these 
methods  are  sensitive  to  starting  data,  and  for  complex  polynomials  the 
regions  of  convergence  can  be  quite  complicated. 


3.  SIA£ILm  QZ  IHE  METHOD 


In  order  to  investigate  the  stability  of  such  an  algorithm,  we  use 


the  matrix  form  of  (3)  and  write 

(4)  Ax  -  b 

where 

S(tj,) 

S(tj,^.i) 

^^^k+p-1^  .  , 


S(^k+p-l^  S(tj^+p_2) 

^^^k+p^  ^^^k+p+1^ 


2(^k+2p-2^  S(^k+2p-3^ 
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X  ■  »  and  b  “  ( S  ( )  ,  S  ( ),...,  S  ( ^)^+2p— 1  ^  ^  ■ 

the  superscript  T  denotes  the  transpose  operation.  The  elements  of 
are  given  by  (2),  which  we  may  rewrite  as  S(kT)  -  Q^Ej^,  where 

=  (qi.q2 . ‘Jp) 

and 


„  2TTi<o  Tk  2nicj.Tk  2niOrx'^k  T 

Ej^=(e  ‘  .e  ^  . e  P  ). 


The  structure  of  the  matrix  allows  us  to  write  the  ith  row  as 


(S(tk+p+i_i)  '^^^k+p+i— 2^  .  ■  .  •  >S(tj^^^)  ) 


and  the  jth  column  as 


^^^k+p-j^ '^^^k+p-j+1^  *  * • • '^^^k+2p-j-l^  ^  ' 


hence  the  element  a^j  =»  S(tj^+p+ ,  and  we  may  express  this  as 


4j  “  Q^^k+p+i-j-1 


The  matrix  A  may  be  written  in  factored  form  as 


Here 

A 
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2rTiu  T(k+p) 

2nio.T(k+p) 

2nio„T(k+p) 

Qie 

^2®  •  ■ 

■  •  qpe  ^ 

2rrio,  T(k+p+l ) 

2nio,T(k+p+l) 

2JTi(j_T(k+p+l) 

qie  ‘ 

q2®  •  ■ 

•  qp®  P 

2ni(j.  T(k+2p-l) 

L  ^1® 


2nicj.T(k+2p-l) 

q2® 


2JTicj^T(k+2p-l) 
qpS  ^ 


X 


-2TTi(J,T 
e  * 

-2TTi(J,T 
e  ^ 


-2nio  2t 
e  ^ 

-2nicj.  2t 
e  ^ 


-2niu^pT 
-2nio. pT 

e 


-2rric.)_T  -2nio_2T 
e  e 


e 


-2jt  iWppT 


Note  also  that  the  first  matrix  may  be  factored  as 


1 

1 

1 

2niu,  T 

27TiCJ,T 

2nio>,T 

e  ‘ 

e 

e  P 

2nicj,T(p-l) 
e  ^ 

2TTiu*T(p-l) 
e  ^ 

2TTiO  T(p-l) 
.  .  .  6  ^ 

2ni<j,T(k+p) 

Qie^  0 

27Tio,T(k+p) 

0  q2e 


0 

0 


L  0 


0 


Qpe 


2n  iOp 


(k+p) 
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We  observe  that  the  right  hand  side  of  (4)  also  has  such  a  factorization, 
and  it  follows  that  we  may  rewrite  (4)  as 


(6) 


W  Q  W  X  -  -  W  Q 


where  W  is  the  left  factor  in  (5).  Q  -  diag(q^,q2 . qp) ,  E^+p 

^  2nio  T(k+p)  2nicj  T(k+p)  2nio_T(k+p)  * 

diag(e  ‘  .e  ^  ^ . e  P  ^  .  W  is  the 


2nicj,T  2nicj,T 


2ni<,j„T 


conjugate  transpose  of  W  ,  and  v  =  (e  ,  ...  ,e“  "  P  )^. 

This  decomposition  makes  the  analysis  relatively  plain,  since  the 
existence  of  a  unique  solution  depends  on  the  nonsingularity  of  the  matrix 

W  and  the  nonvanishing  of  the  components  of  Q. 


REMARK  1 .  The  matrix  W  is  in  the  form  of  the  Vandermonde  matrix,  so  it 


is  obvious  that 

does  not  vanish 


2n  i  Q  T 

W  is  nonsingular  if  the  product  IT  (e 

kT^j 


(see  [3] ) . 


2nic.vT 
e  ^  ) 


2ni''fT, 

REMARK  2 .  Since  qj^  =  pj^e  k  =  l,2,...,p,  the  matrix  Q  is 

nonsingular  as  long  as  the  signal  S  consists  of  the  frequencies  k 


-  1 , 2, . . . ,p. 

As  pointed  out  in  (1]  and  [6J ,  if  t  is  chosen  sufficiently  small 
so  that  for  the  frequencies  w  which  are  in  an  interval  (o<,P),  we  have 

2  n  i  tj  •  T 

(P-o<;)t  <  1,  then  the  above  method  is  stable  since  the  values  e 

will  all  be  distinct.  However  this  choice  of  r  leads  to  inaccuracies  in 

data  collection,  and  as  a  result  it  is  necessary  to  choose  a  larger  value 
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for  T.  In  light  of  this,  we  now  have  two  problems:  namely  the 
instability  of  the  above  method  and  the  determination  of  the  correct 
frequency  from  ambiguous  information.  Note  that  the  method  is  now 
inherently  unstable  because  for  a  given  frequency  u  such  that  u+a  s 

1  <j 

(a.b),  if  T  >  ; -  ,  then  to  >  - —  and  it  follows  that  it  is  possible  to 

b-a  b-a 

find  values  and  02  for  which  to^^  >  1  and  toj^  =  j  +  T02 ,  hence 

2nio.T  2nio,T 

e  ‘  =  e  ■"  .  In  this  case,  the  product  mentioned  in  Remark  1 

vanishes  and  the  matrix  W  is  singular. 

The  problem  may  be  complicated  further  by  the  assumption  that  there 

exist  p  frequencies  while  p  frequencies  are  present  and  p  <  p.  This 
is  the  point  made  by  Remark  2.  Although  this  leads  to  an  instability,  one 
may  evade  this  problem  by  a  proper  strategy  in  the  implementation  of  the 
algorithm.  In  this  case,  the  Gaussian  Elimination  routine  in  which  the 
coefficient  matrix  is  tr iangularized  has  a  built-in  check  to  determine 

whether  the  rank  p  is  less  than  p.  If  so,  the  autocorrelation  matrix 

is  recalculated  assuming  p  frequencies  are  present. 

In  addition  to  the  above  mentioned  problems  associated  with  this 
scheme,  it  is  clear  that  the  addition  of  signal  noise  and  computation 
error  may  lead  to  large  errors.  In  general,  since  the  coefficient  matrix 
consists  of  the  autocorrelation  coefficients,  one  expects  the  noise  to 
perturb  the  diagonal  of  the  matrix  (see  [5]). 
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4. 


Once  a  value  of  t  has  been  selected,  the  algorithm  is  clear  until 
we  must  decide  the  true  values  of  the  frequencies  , ^2  «  •  •  •  « •  Thus, 

with  the  scheme  we  have  up  to  this  point,  we  may  obtain  a  solution  to  the 
system  (4)  as  long  as  a  unique  solution  exists,  and  then  employ  a  root¬ 
finding  technique  to  get  values  k=l,2,...,p,  which  are  the  zeros  of 


the  polynomial  P(z).  These  values  satisfy  =  e  ^  • 

REMARK  3 .  It  is  worth  mentioning  at  this  point  that  root-finding 


techniques  for  complex  polynomials  are  inherently  unstable  and  extremely 
sensitive  to  starting  points-.  In  (Sis  itSF&tion  SOtiSIii^S  lOl'-  Fi'lCtt'lOtlS  Of 
this  nature  are  pointed  out  to  be  related  to  chaos,  which  arises  from  the 
complicated  regions  of  convergence  of  this  method  in  the  complex  plane. 

2n  icJvT 

Now  that  the  values  of  e  ^  have  been  obtained,  we  only  know  the 
principal  value  of  That  is,  we  know  a  value  where  '^k  '''  ^k  ^ 

for  some  integer  Mj^.  This  problem  of  ambiguity  has  been  the  subject 

of  research  by  the  author  (see[ll)  and  others  (see  [6,7]).  Clearly,  an 
error  made  in  the  value  of  can  lead  to  rather  disastrous  results, 

since  the  calculated  value  for  wi  1 1  be  off  by  at  least  1/t,  where  r 


is 


relatively  small  (0.015,  for  some  examples  given  below). 

Errors  arising  from  computation  or  noise  can  degrade  the  values  of 

and  since  the  bandwidth  is  quite  large,  we  are  still  working  with 


quite  small  values  of  t  ,  hence  these  errors  tend  to  be  magnified  when  we 
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solve  for  cjj^.  Thus,  to  explain  the  algorithm,  we  first  assume  we  are 

working  with  the  true  values  of  0  and  follow  this  by  altering  the 
algorithm  in  an  effort  to  compensate  for  the  errors. 

Suppose  the  width  of  the  interval  containing  the  frequencies  is  a 
given  integer  B  (band  width)  and  assume  that  cj  e  (0,B)  .  Note  that  for 
the  practical  case,  a  s  (^,A+B),  where  A  -  2Ghz  and  B  =  2Ghz .  Let  . 

K2,  and  K3  be  given  integers  so  that 
(6)  +  K2  -  K3  -  0 

and  ,  K2 ,  K3  are  relatively  prime.  Further  assume  that  B  is  a  prime 

integer.  These  assumptions  will  be  helpful  in  damping  errors. 

If  we  define  =  K^/B.  T2  -  K2/B  and  T3  -  K3/B,  we  apply  the 

above  scheme  to  obtain  three  sets:  Tj^ ,  T2.  and  T3,  where,  if  9  £  Tj^ .  then 
for  some  M,  and  some  q  9  +  M  -  if  'f  e  T2  then  there  is  an  integer 

N  and  some  o  such  that  +  N  =  T2tj,  and  if  y  £  T3  then  there  is  an 
integer  P  and  an  cj  such  that  y  +  P  =  'r3cj. 

Define  the  set  Q  -  {c;>2^,cj2'  •••  to  be  the  set  of  frequencies 

actually  represented  in  the  signal.  Now  if  cj  £  Q.  then  there  exist 
values  9  £  Tj^ ,  '■f  £  T2  and  -v  £  T3  and  integers  M,  N,  and  P  such  that 

9+M  =  Tj^u,  'f  +  N  =  •T2tJ  y  +  P  =  T3f,j . 

From  the  property  (6),  it  follows  that 
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(9  +  M)  +  ('f  +  N) 


(y  +  P) 


0 


hence 


Now  &.  and  all  lie  in  the  interval  (0,1).  so  the  inequality 

-1  <  0  +  'f  -  <  2 

must  be  satisfied,  and  since  P  -  N  -  M  is  an  integer,  the  only  candi¬ 
dates  for  this  combination  are  0  and  1.  This  forms  a  necessary 
condition  for  proper  association  of  the  triple  (9.’^,+),  hence  we  have  the 
following  definition: 

Definition.  We  say  that  the  triple  (9,'f,y)  is  a  properly  associated 
candidate  if  the  condition 


I  0 

(7)  9  +  =  I  ^ 

holds . 

We  use  the  term  'candidate'  here  because  it  is  possible  to  satisfy 
(7)  and  yet  be  an  improperly  associated  triple.  Thus,  the  algorithm  for 
proper  association  is  aimed  at  collecting  all  triples  satisfying  (7)  and 
then  eliminating  the  improper  associations,  as  illustrated  in  the  examples 
below. 

Now  suppose  we  have  a  properly  associated  triple.  Then  since 


0+M 


'■f+N 


we  have  the  relation 

(8)  K2©  -  K^'-f  -  K^N  -  K2M. 

We  are  looking  for  a  solution  pair  (N.M)  where  0  S  M  <  and 

0  <  N  <  K2  (note  that  we  may  have  used  a  combination  with  f  and  P,  where 
0  i  P  <  K3)  .  Since  and  K2  are  relatively  prime,  there  exists  a 

unique  pair  (N.M)  for  which  (8)  is  satisfied  (see  .  e.g.  [8]). 

It  should  be  pointed  out  here  that  the  above  scheme  is  rather 
delicate.  Consider  the  following  example. 

Examp I e  1 .  Suppose  cj-j  =  co^  +  x  .  where  e<  =  B/K^  ,  Then  while  0j  +  Mj  = 
we  find  that  ^k  +  1  “  ^ j  +  Mj  +  1 .  So  if  (7)  is 

satisfied  by  the  triple  (0  j , .  it  will  also  be  satisfied  by  (0}^.'f.-V). 

Note,  however,  that  for  these  values  of  u  the  matrix  W  in  (6) 
would  be  singular,  and  some  adaptation  must  be  made.  To  account  for  this 
singularity  we  make  the  observation  that  the  contribution  of  qj  and 
in  the  values  of  S(nT)  given  by  (2)  result  in 

2 n  i cj .  n T  2n  i c;. nr  2tt  i nr  2tt  i u_  nr 

S(nT)  =  q^e  ‘  +  .  .  .  +  qje  J  +  .  .  .  +  qj^e  ^  +  .  .  .  +  qpS  P 

and  since 
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2nio,nT 


qk© 


2rTicjknT 


(q 


Qk)© 


2niOjnT 


then  the  rank  of  the  original  system  corresponding  to  t  is  reduced. 

Thus,  by  solving  a  system  (4)  which  has  been  reduced  to  a  system  which  has 
rank  <  p,  we  generate  a  set  with  p^  distinct  values. 

So  the  first  adaptation  we  make  is  to  calculate  the  rank  of  the 
system  (4)  for  each  and  find  the  three  sets  ,  T2  and  T3  with  p^ . 

P2  and  P3  distinct  elements,  respectively. 

In  the  next  example,  we  see  how  it  is  possible  for  a  false 
association  in  another  way.  We  first  note  that  for  each  ©  s  there 

exist  values  9  and  9  in  T2  and  T3  respectively,  for  which  (7)  is 
satisfied . 


Example  2.  If  (B.9.-V)  is  a  properly  associated  triple  corresponding  to 
cj,  suppose  that  03  ~  where  T2c<  +  T3P  is  an 

integer.  Then  (7)  will  be  satisfied  by  since 


T CO  4*  ^2^’^2  ~  ~  r^^co  +  T2C0  —  T3CJ  +  3 

from  which  it  follows  that  ®  +  '^2  “  ’*'3  "  0  I  •  Note  also 

happen  even  if  the  system  (4)  has  a  unique  solution  for  , 


that  this  may 
T2  and  T3 . 


In  order  to  adapt  to  this  problem,  we  first  consider  the  case  in 
which  all  three  sets  Tj^ ,  T2 ,  and  T3  have  p  elements.  This  means  that 

for  each  8  s  T^^ .  there  exists  exactly  one  9  and  one  y  for  which  the 
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triple  is  properly  associated,  similarly,  for  'f  s  T2  there 

exists  exactly  one  element  d  e  and  one  element  >1'  e  T3  for  which  the 

triple  O.'f.y)  is  properly  associated.  The  same  statement  may  be  made 
for  y  €  T3 .  If  either  9  or  appears  in  no  other  triple,  then 

(e.'f.'V')  is  correctly  associated.  In  this  manner  we  may  collect  those 
triples  for  which  (7)  is  satisfied  and  for  which  any  one  of  8.  .  or  y 
appear  only  once. 

Before  proceeding,  the  algorithm  should  be  illustrated  for  the  case 
p  =  2.  In  this  case,  we  generate  the  three  sets 


{'^1  .  '^o> 


•♦'o) 


and  the  matrices  M(8j)  are  formed  in  which  a  1  is  placed  in  the  j.kth 
position  if  i  * '^j « satisfies  condition  (7).  Note  that  if  each  set 

consists  of  2  elements  we  then  have  two  2»2  matrices  indicating  the 
candidates  for  proper  association.  We  must  determine  the  candidates  which 
are  correctly  associated.  We  have  two  cases  which  can  occur: 

Case  1:  M(8j^)  = 


and  M(82) 


[0  n. 


Case  2:  M(8j^) 


=  M(82) 


It  is  easily  seen  that  there  are  no  other  combinations  possible.  For 


example,  if  M(0j^)  ”  [  q  o  J  '  both  (0^  . '-f )  and  - '■^1 . +'2 ^ 

would  satisfy  (7),  and  from  that  relation,  it  follows  that  -Vi  -  ■V2'  which 

is  not  the  case.  Note  that  in  Case  2  it  may  appear  that  there  are  4 
possible  associations,  but  when  equation  (8)  is  solved  using  K2  and  K3.  it 

follows  that  there  are  only  two  values  of  o  calculated. 

If  one  of  the  sets  only  has  one  element,  say  Tj  =  then  we  have 


r  1  0  1 

the  matrix  “  1  q  1  both  (8^^ , )  and  ('^1  *'■^2  ’ '*'2  ^ 

correctly  associated  triples.  It  is  impossible  for  either  of  the  other 
elements  to  be  nonzero,  for  if  this  were  the  case,  then  we  have,  say 


M(0i) 


1 
L  0 


1 
1  J 


,  which  implies  > '^1  ’ '<^1  ^  * '^2  ^  both 


satisfy  equation  (7).  But  this  means  that  f 2  -  which  is  not  the 


case.  Note  that  here  there  may  only  be  one  set  with  one  element,  and  the 
other  two  sets  must  have  2  elements.  This  is  because  if.  in  addition. 


T3  -  { }  ,  since  ^2  =•  +  c<.  where  t^^oc  s  r/  (the  integers),  then  we  also 

must  have  T3oc.  s  3^.  Hence  «:  =  l/'^i  =  J/'''3,  where  I.  J  £  Since  «  < 

B,  we  must  have  0  <  I  <  and  0  <  J  <  K3  satisfying  Kj^J  =  K3I.  where 

and  K3  are  relatively  prime.  But  since  this  cannot  happen,  then  T3 

must  have  two  distinct  elements. 

In  theory,  it  has  been  shown  that  this  scheme  works  for  all 
situations  of  order  2.  Experiments  show  however,  that  the  method  works 
best  if  there  are  no  errors,  either  due  to  roundoff  or  to  noise.  It  is 
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the  purpose  of  the  next  section  to  show  how  this  may  be  improved  to  avoid 
the  catastrophic  errors. 


5.  ERROR  COMPENSATION. 

The  problem  of  non  exactness  in  the  algorithm  has  been  the  subject  of 
much  study  both  by  the  author  and  by  McCormick  (see  [11, (6, 7])  and  the 
source  of  many  huge  errors,  hence  illustrating  the  non-Lipshcitzean  nature 
of  this  problem.  It  was  McCormick/Tsui  [7]  who  finally  saw  a  means  to 

reduce  the  sensitivity  of  the  algorithm  to  noise.  If  we  take  the  data 
obtained  by  the  Prony  method  and  form  the  sets  Tj^,  T2  and  T3,  then 

quantize  these  values,  by  assuming  P  bits  of  accuracy,  we  can  replace 
the  calculated  values  by  quantized  values  and  apply  the  algorithm  to  the 
new  sets  of  quantized  estimates  to  the  principal  values.  Thus,  denote  by 
0^,  and  the  true,  calculated  and  quantized  values.  This  means  that 

nintC^c^^) 

so 


lei 


q 


and  1 ~  ^  ' 


where  s  is  a  bound  on  the  error  due  to  roundoff  and  noise.  In  the 
algorithm,  the  first  thing  necessary  is  to  see  if  an  ordered  triple 
satisfies  (7)  .  If  =  S ,  where  S  =  0  or  1  from  equation  (7)  , 
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then 

>  ~  ■’♦'a  ~  ^  I  <  3  f-i-  +  £  1  • 

q  q  <3  u  2^+1  ^ 

Using  this  bound  as  a  guide,  although  a  bit  pessimistic,  from  practice 
it  has  been  found  that  a  criterion  for  acceptance,  i.e.,  that  (9^,9^. 

satisfy  (7)  is  if  19  +  -  SI  <  — ^ 

M  Mm  — 1 


Once  we  have  determined  that  such  a  triple  is  properly  associated, 
the  question  is,  how  severe  an  error  is  possible.  From  (8)  we  have 

K2®q  -  =  K2(0q  -  ©t  +  ^t^  "  C^q  -  '^t  +  '^t ) 

=  KiN  -  K2M  +  K2«?q  -  -  KiC^q  -  '^t  > 

and  since 


K^  +  Ko 

IKjCSq  -  9t)  -  -  -ft)  I  <  -^-777^ 


Ki  +  K:>  1 

then  we  should  require  that  ^  <  —  .  This  being  the  case,  we  may 

2 

replace  the  left  side  of  (8)  by  nint(K28q  -  K^'^q)  ,  and  solve  the 

resulting  equations  for  the  appropriate  integer  values  of  M  and  N. 
Furthermore,  since  the  values  K^,  K2  and  K3  must  be  ordered  in  some 

way,  say  <  K2  <  K3,  a  method  to  control  the  error  would  be  to  use  the 

equation  (8)  with  K3 .  There  are  two  possibl ities ,  namely: 
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(10a)  nint(K30q  -  K^'f'q)  =■  K^P  -  K3M 

or 

(10b)  nint(K3'fq  -  K2'('q)  *  K2P  -  K3N 

The  reason  for  this  is  that  once  a  pair  (P.M)  has  been  determined,  the 
solution  cj  is  of  the  form 


Cl, 


-  C-Vq  +  P) 


so  that 


1  CJ^  -  CJ  I 


£ 


]■ 


The  requirement  that  Icj^  -  cj|  <  1  imposes  the  final  restriction  on  the 
number  of  bits  P.  the  size  of  K3  and  the  accuracy  of  the  calculation  of 


1 

.  for  example,  if  £  <  - 

q  2^+1 


K3  >  —  .  As  an  illustration  with 
2^ 


then  the  requirement  on  K3  is  that 
B  2000,  if  P  =  7  then  K3  >  16. 


In  order  to  discuss  the  error  e  we  first  recall  that  in  equation 
(6)  the  existence  of  a  unique  solution  x  depends  on  the  nonvanishing  of 
the  product  given  in  Remark  1.  Suppose  that  the  values  are  all 

distinct,  j  -  l,2,...,p.  Then  to  simplify,  the  solution  of  (6)  is  also  a 
solution  of 
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(11)  W*x  -  -V. 

Errors  from  noise  and  roundoff  will  also  occur  in  the  product  W  Q  Ej^+p_i. 
however  as  a  point  of  illustration,  it  will  suffice  to  look  at  (11). 

Since  W  is  the  conjugate  transpose  of  W  .  the  existence  of  a  unique 
solution  of  (11)  depends  on  the  non-vanishing  of  the  determinant,  which  is 
given  by 


det(W*) 


_  -2niuh.r  -2T!iUiT 
n  .  (e  -  e  J  )  . 


It  is  evident  that  as  this  value  gets  near  zero,  the  sensitivity  to 
perturbations  in  the  data  increases.  It  is  also  clear,  that  since  the 
error  in  computation  of  the  solution  to  (6)  under  perturbations  of  the 
matrix  A,  say  <^A  satisfies  the  inequality 


(12) 


I 


IXc  -  x^l 


1  - 
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II  II^AII 


[ll^AII  IIA“^II  llvll] 


(see  [9]),  it  is  possible  that  the  denominator  in  (12)  cannot  be 
guaranteed  to  be  positive,  hence  it  is  possible  for  the  computed  values  of 
the  coefficients  of  P(2)  to  contain  enough  error  so  that  the  roots 

obtained  cannot  be  trusted.  Here,  the  norm  II  •  It  is  taken  to  be  ll'll^c,. 
where 
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p  p 

11x11^5  -  Z  IXjl.  and  HAII^q  =  max  Z  la^jl. 
i“l  1  j=l 


As  a  final  comment,  the  error  in  x^,  will  degrade  the  accuracy  in 

the  computed  values  of  the  roots  of  P(z) .  As  an  estimate  via  first-order 
expansion,  take  the  case  of  p  -  2  and  let  the  error  in  a^  be  given  by  Sj, 

i  -  1.2.  Then  the  error  in  the  roots  can  be  estimated  by 


2Sj(e 


2nicjjT 


2n  iu^jT 

e  2  ) 


iL- 


4S- 


2n  icjj^T 


2n  iu-jT 

e  ^ 


Note  again,  the  error  is  related  to  the  closeness  of  the  values  of 
and 

Example  3.  Suppose  we  have  p  =  3,  =•  13.  K2  =  21,  K3  -  34,  P  =  7 

and  B  =  2143.  If  we  know  that  the  signal  contains  the  frequencies  3000, 
3170,  and  3042,  all  in  the  range  ( 2000 , 2000+B) ,  then  through  scaling  to 
the  interval  [0,B],  we  are  searching  for  the  frequencies  1000,  1170  and 
1042.  With  these  values  known,  we  check  the  algorithm  to  see  if  it  is 
possible  to  retrieve  these  values,  and  follow  through  the  steps.  Once  the 
roots  of  the  polynomials  have  been  found  and  the  arguments  quantized,  we 
have  the  following  table; 
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Tl 


T 

c 

/B 

2^q 

K2/B 

c  2^q 

K3/B 

c  2’^q 

e 

.066 

8 

.098 

12 

.321 

41 

9 

.790 

102 

.465 

60 

.  211 

27 

9 

.866 

111 

.563 

72 

.532 

68 

Table  1. 

8  12  41  1 

60 

102  1 

1  68 

72 

111 

8'128'128  ' 

T2  - 

1 

•  128' 

128' 

128  ' 

T3  - 

1  128' 

128 

The  indicator  matrices  are: 


■  1  0  0  1 

■  0 

0  0  ■ 

■  1 

0  O' 

MCS^)  = 

0  0  0 

.  0  0  1  . 

M(e2)  = 

0 

.  0 

1  0 

0  0  . 

M(e3)  = 

i 

0 

.  0 

0  0 

0  0. 

The  triples  satisfying  (7)  are: 


.•■f’3  .^3)  ,  '  '^'2^  ’  ^*^3''^!  '  ■*'1  ^ 


Note  that  we  eliminate  )  since  all  three  entrys  occur  in  other 

triples  and  the  T^  all  contain  p  distinct  elements. 


Using  equation  (10b),  we  have 


nint(K3'^3  ~  ^2^3^ 
nint(K3'f2  “ 
nint(K3't'2^  -  K2+1) 


21P 

-  34N; 

P  =  15 

and 

N  =  9 

21P 

-  34N: 

P  =  18 

and 

N  =  11 

-4  =  21P  -  34  N;  P  =  16  and  N  =  10 


With  these  values  we  solve  for  cj  : 


57-22 


2143 

111 

CJi  =  - 

^  34 

'  128 

2143 

72 

02 

34 

^128 

2143 

68 

03 

34 

^128 

+  15) 


+  18) 


+  16) 


1000.09 


1169.98 


1041.95 


6.  CONCLUSIONS 

From  the  above  discussions,  it  is  evident  that  the  algorithm 
including  the  prony  method  and  Newton's  method  for  finding  roots  along 
with  the  adaptations  is  unstable.  The  reasons  for  this  instability  lie  in 
the  problem  itself,  namely  that  if  two  frequencies  occur  in  a  certain 
combination  and  if  there  is  noise  present,  the  calculated  values  can  be 
degraded  to  the  extent  that  computed  values  are  unacceptable.  In  spite  of 
this  undesirable  aspect  of  the  problem,  the  case  for  p=2  is  probably  best 
since  an  iterative  technique  is  not  necessary  for  computation  of  roots. 
Perturbations  of  the  diagonal  of  the  matrix  A  of  the  order  of  1%  are 
allowable,  and  in  some  cases,  even  larger  perturbations  can  be  tolerated. 
Experiments  with  higher  order  systems,  say  order  5  or  so  have  shown  an 
extreme  sensitivity  to  noise,  and  perturbations  of  the  diagonal  on  the 
order  of  about  .1%  are  near  the  intolerance  level. 

The  algorithm  represented  here  is  an  improvement  over  the  original 
(see  (!]),  perhaps  mainly  due  to  the  technique  communicated  by  McCormiclc 
in  (61  regarding  the  quantization  of  the  values  prior  to  applying  the 
association  algorithm. 
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SIGNAL  PROCESSING  IN  EW  ENVIRONMENT 
By  John  Y.  Cheung 

ABSTRACT 

This  is  the  final  report  performed  for  the  contract  entitled  “Signal  Processing  in 
EW  Environment”.  Activities  related  to  this  contract  can  be  summarized  in  three  main 
aireas  as  outlined  in  the  proposal:  theoretical  development  an^^  simulation  of  the  direct 
adaptive  frequency  estimation  (DAFE)  algorithm,  theoretical  development  and  simulation 
of  the  effect  of  partially  received  signals  on  frequency  estimation  methods,  and  performance 
comparisons  through  simulation  of  various  frequency  estimation  methods.  These  three 
areas  are  conducted  by  the  the  principal  investigator,  a  visiting  scholar,  and  numerous 
graduate  students  throughout  the  contract  period. 
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SIGNAL  PROCESSING  IN  EW  ENVIRONMENT 


By  John  Y.  Cheung 


1.  Introduction 

This  is  the  final  report  on  the  work  performed  related  to  the  contract  entitled  “Signal 
Processing  in  EW  Environment."  The  application  of  signal  processing  techniques  in  detec¬ 
tion  and  estimation  of  the  spectral  contents,  particularly  the  harmonic  components  of  the 
received  signal  is  an  important  process  in  the  defense  related  environments.  The  problem  is 
further  complicated  by  many  additional  constraints  in  the  electronic  warfare  (EW)  arena. 
The  purpose  of  the  present  contract  is  to  analyze  current  frequency  estimation  techniques 
and  to  develop  new  techniques  that  are  applicable  for  EW  environments. 

There  are  many  constraints  in  the  EW  environments.  Received  signals  are  often  com¬ 
plex,  of  short  durations,  of  relatively  high  signal  to  noise  ratio,  overlapping,  of  varying 
types,  and  of  different  origins.  Complex  signals  are  often  obtained  thus  providing  both 
magnitude  and  phase  characteristics.  Though  the  signal  received  is  of  relatively  high  signal 
to  noise  ratio,  processing  is  extrememly  difficult  becaxise  the  signal  duration  is  extremely 
short.  Additional  factors  further  complicates  the  effectiveness  of  many  conventional  tech¬ 
niques.  For  example,  stationary  conditions  are  often  violated  because  there  are  many 
overlapping  signals  beginning  and  ending  within  the  same  block  of  sampled  data.  The 
sampled  data  often  include  different  types  of  signals  and  signals  from  many  different  di¬ 
rections. 

There  are  a  number  of  conventional  techniques  currently  proposed  for  frequency  es- 
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timation  luch  aa  Prony's  method  and  other  spectral  estimation  methods.  Most  of  these 
techniques  require  a  tremendoxis  amount  of  computations  and  are  of  less  practical  value 
in  EW  environments.  Furthermore,  due  to  the  above  mentioned  constraints,  many  con¬ 
ventional  techniques  cannot  be  applied  when  the  stationary  condition  is  violated.  The 
concept  of  adaptive  techniques  for  spectral  estimation  has  been  under  investigation  for  the 
last  two  decades.  The  goal  of  the  present  research  is  to  adapt  and  modify  the  adaptive 
techniques  for  spectral  estimation  for  \ise  speciBcally  in  frequency  detection  and  estimation 
environments. 

The  primary  area  of  investigation  during  the  present  contract  period  is  frequency 
estimation.  However,  the  research  effort  has  been  concentrated  in  three  directions: 

1.  The  development  of  a  new  adaptive  frequency  estimation  method  for  complex 
signals 

2.  An  evaluation  of  performance  degradation  in  partially  received  signals 

3.  A  comparison  of  performance  for  various  frequency  estimation  methods 
Each  of  these  three  areas  are  now  discussed  in  more  details  in  the  following  sections. 
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3.  The  DAFE  algorithm 


There  are  a  number  of  conventional  frequency  estimation  techniques.  These  can  be 
categorized  into  two  main  categories; 

1.  Direct  frequency  estimation  methods  such  as  Prony's  method 

2.  Spectral  methods  by  means  of  spectral  estimation 

Direct  frequency  estimation  methods  uses  the  original  signal  to  directly  calculate  the  mag¬ 
nitudes  and  phases  of  any  frequency  components  present  in  the  signal.  No  intermediate 
parameters  are  used.  Methods  in  this  category  include  Prony’s  method  and  its  variants. 
In  a  somewhat  indirect  way,  some  methods  use  the  eigenanalysis  of  the  covariance  ma¬ 
trix  to  estimate  the  frequency  contents  of  the  received  signal.  The  second  category  of 
frequency  estimation  methods  are  actually  adapted  from  spectral  estimation  methods.  Af¬ 
ter  the  spectrum  is  estimated,  a  threshhold  is  used  to  identify  the  peaks.  The  locations 
and  heights  of  the  peaks  correspond  to  the  frequencies  and  magnitudes  of  the  harmonic 
components. 

Several  problems  exist  with  these  conventional  techniques.  First,  Prony’s  method 
requires  a  procedure  for  finding  the  roots  of  a  polynomial.  The  order  of  this  characteristic 
polynomial  is  determined  by  the  number  of  signals  present.  Rooting  a  polynomial  is  an 
open  ended  problem.  It  is  not  a  particularly  easy  task  and  requires  a  fair  amount  of 
computations.  Secondly,  Prony’s  method  requires  prior  knowledge  of  the  order  of  the 
system,  i.e.  the  number  of  frequencies  present.  Incorrect  order  gives  grossly  inaccurate 
estimation  results.  If  the  order  is  unknown  a  priori,  then  Prony’s  method  mxist  be  solved 
for  each  order.  A  selection  criterion  is  used  to  choose  the  proper  order. 
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For  spectr&l  estimation  methods,  the  amount  of  computation  b  abo  extensive.  Most 
spectral  methods  require  the  computation  of  the  covariance  matrix,  from  which  the  auto¬ 
regressive  and  moving  average  (ARMA)  coefficients  are  estimated.  (Depending  on  the 
algorithms,  sometimes  may  be  strictly  AR  or  MA  coefficients  that  are  estimated.)  Some 
methods  only  yield  the  auto-regressive  coefficients  while  others  give  both  sets  of  coefficients. 
After  the  coefficients  are  obtained,  a  Fast  Fourier  transform  b  \isually  used  to  generate  an 
estimate  of  the  power  spectral  density  from  which  the  frequency  peaks  are  identified  and 
located.  Even  though  many  iterative  algorithms  can  be  used  to  solve  for  a  set  of  linear 
equations,  the  whole  process  b  still  quite  time  consuming.  Furthermore,  the  magnitude 
estimate  b  quite  imreliable. 

In  view  of  the  above  limitations  of  conventional  methods,  the  present  research  uses 
an  adaptive  approach  for  frequency  estimation.  The  adaptive  approach  has  several  ad¬ 
vantages.  First,  an  adaptive  approach  can  be  more  effectively  applied  when  the  received 
signal  b  not  stationary  but  b  slowly  varying.  An  adaptive  approach  can  not  only  detect 
the  presence  or  absence  of  a  particular  frequency  component,  but  also  track  signals  with 
slowly  varying  frequencies.  Thb  b  the  basic  motivation  for  developing  the  direct  adap¬ 
tive  frequency  estimation  (DAFE)  method  to  estimate  the  magnitude  and  phase  of  any 
frequency  components. 

The  DAFE  algorithm  b  effective  lor  detecting  and  estimating  the  frequency  and  cor¬ 
responding  complex  (or  real)  amplitudes  from  complex  (or  real)  input  signals.  There  are 
no  theoretical  limits  as  to  the  total  number  of  frequencies  present  in  the  signal.  There 
may  be  a  practical  limit  depending  on  the  availability  of  hardware  and  computation  time. 
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The  DAFE  algorithm  requires  simple  computations  and  many  of  the  calculation  can  be 
performed  in  parallel  further  speeding  up  the  total  process.  In  addition  to  applying  the 
DAFE  algorithm  in  parameter  estimation  problem  where  the  model  order  is  known,  the 
algorithm  can  also  be  applied  in  system  identification  environments  where  the  model  order 
is  not  known. 

There  are  two  versions  of  the  DAFE  algorithm.  The  first  and  simplest  one  of  the 
two  makes  use  the  least  means  square  (LMS)  adaptive  algorithm.  The  second  and  more 
complex  one  makes  use  of  the  recursive  least  square  (RLS)  technique.  The  basic  principle 
of  both  versions  is  the  same.  The  steps  can  be  itemized  as  follows: 

1.  From  the  original  signal,  tise  the  property  of  a  sine  wave  to  successively  generate 
intermediate  signals  each  with  one  sintisoid  removed.  If  all  sinusoids  have  been 
removed,  the  signal  must  be  the  noise  sequence  with  all  sinusoidal  components 
removed. 

2.  Use  the  error  signal  to  adaptively  change  the  frequency  estimates  in  such  a  wa^- 
that  the  magnitude  of  the  error  signal  is  minimized. 

3.  Use  the  error  signal  at  the  same  time  to  adaptively  estimate  the  complex  ampli¬ 
tudes  of  each  corresponding  frequency  components. 

4.  Repeat  steps  1  through  3  for  each  new  data  point  received. 

The  detailed  analytical  development  of  the  LMS  version  of  the  DAFE  algorithm  is 
given  in  two  papers.  The  first  paper  entitled  “A  Direct  adaptive  frequency  estimation 
technique"  contains  preliminary  information  and  has  been  presented  in  the  30^^  Midwest 
Symposiiim  on  Circuits  and  Systems  in  Syracuse,  NY  on  August  17-18,  1987.  The  paper 
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18  &1bo  published  in  the  conference  proceedings.  A  reprint  of  this  paper  has  been  included 
in  this  report  as  Appendix  A.  A  second  paper  entitled  “A  structured  adaptive  frequency 
estimation  technique”  has  been  prepared  and  submitted  to  the  IEEE  IVansactions  on 
Acoustics,  Speech,  and  Signal  Processing.  The  second  paper  b  more  complete  in  the 
analytical  developments  and  b  included  in  thb  report  as  Appendix  B. 

The  first  two  papers  only  include  the  development  of  the  DAFE  algorithm  using  the 
LMS  method.  A  third  paper  b  being  prepared  at  this  point  containing  the  analytical 
development  and  simulation  results  using  the  RLS  algorithm.  Proper  acknowledgment  to 
the  support  of  thb  contract  will  be  given  when  the  third  paper  b  submitted  as  with  the 
first  two  papers.  The  use  of  the  RLS  method  for  adaptation  reduces  the  adaptation  time 
by  one  half.  Thb  reduction  in  adaptation  time  b  expected. 

Estimating  Frequencies  and  Real  Amplitudes  bv  The  RLS  Algorithm 

Assume  the  input  signal  b  real  and  b  represented  as  follows: 

M 

=  (2-1) 

i=i 

where  M  b  the  number  of  dbcrete  frequencies,  ay’s  are  the  real-valued  amplitudes,  and 
uy(n)’s  are  the  individual  sinusoids,  i.e., 

uy(n)  =  Bin(u;yn)  (2-2) 

Using  the  property  of  a  sine  wave,  we  have  sin  wn  —  w  sin  tu(n  —  l)  -I-  sir.  u)(n  —  2)  =0.  Let 
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ua  then  define  the  following: 


Xi(n)  =x(n)  -  u)iz(n  —  l)  +  z(n  —  2) 
i2 (n)  =xi  (n)  -  tDjXi (n  -  1)  -t-  xi  (n  -  2) 

Z3(n)  =i2(n)  -  tU3i2(n  -  1)  +  X2(^  -  2)  (2-3) 

Z4(n)  =i3(n)  -  W4Xz{n  —  1)  +  i3(n  -  2) 


XM{n)  =XM-i{n)  -  -  1)  +  -  2)  (2-4) 

In  2-2,  the  first  operation  effectively  removes  the  sinusoidal  component  of  the  sec¬ 
ond  removes  the  next  sintisoidal  component  of  u/2,  etc.  Since  there  aie  only  M  discrete 
frequencies  present,  iM(n)  must  contain  noise  alone  because  all  M  sinusoids  have  been 
removed.  This  error  can  be  expressed  in  a  slightly  different  form  as  follows: 

e(n)  =  XM(n)  =  d(n)  -  A:^(n)lV^(n)  (2-5) 

where: 

d(n)  =  z(n)  -t-  i(n  -  2)  +  ii(n  -  2)  +  Z2(n  -  2)  -K  .  •  .XM_i(n  -  2) 

■^Adr(")  =  [^(«- 1)  xi{n-l)  X2(n-1)  ...  ZM-i(n-l)] 

I 

1 

I 

^M(n)  =  [t&i  U>2  It's  ...  Wm] 

The  weighted  squared  error  can  immediately  be  found  by  squaring  the  expression  for  the 
error  above: 

£(n)  =  ^r-V(0  (2-6) 

t=l 
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In  the  gradient  search  method,  the  minimum  error  location  can  be  found  by  setting 
=  to  Kro,  then 

(2-7) 

where 

n 

RMM{n)  =^A"-‘Xm(0-?^m(0 

»S=1 

n 

P„(n)=53A”-‘<i(>-)XM(.) 

«S=1 

The  above  matrix  equation  gives  the  expression  for  the  optimal  weights: 


w'mW  =  -RmImW  ■?«(") 


(2-8) 


But  to  get  an  estimate  of  the  weights,  we  must  know  the  inverse  of  the  covariance  matrix 

observe  from  the  definition  that  can  be  written  as: 

RmmI^)  =  A.RAfA/{»  ~  1)  +  ^M{^)X^{n)  (2-9) 


Using  the  matrix  inverse  lemma,  a  recursive  update  for  ^  terms  of  the  previous 

inveror ,  .ffjj^^(n  —  1),  and  the  new  data  acquired  at  time  n  can  be  obtained: 


^AfM(”)  “  J 


p-l  (”  (>^^) (^) R-MM (^  ~  ^) 

.  ^  X^Xl{n)Rll^[n-l)X^[n)  , 


(2-10) 


Define  GmM,  CuMin),  g(n)  as  below: 


GMM(n)  — 

^  -  1)^m(»») 

=  - T"; — T—{ - 

A  +  g(n) 


(2-11) 

(2-12) 
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,(n)  =  -  I)X«(n) 


(2-13) 


Now  (2-8)  can  be  rewritten  as: 


“  1)  -  GM{n)Xlf{n)CMM{n  -  1)] 


(2-M) 


Likewise  Pm{^)  ®'Iso  be  expanded  as: 


PMin)  =  XPj^ln  -  1)  +  d(n)XM(n) 


(2-15) 


Substituting  (2-11),  (2-12),  (2-13),  and  (2-14)  into  (2-9),  we  obtain  the  following  expres¬ 
sion  for  Wjif{n): 


^M{n)  -  1)  --  GM{n)X‘][f[n)CMM{n  -  l)]l^-PM(n  -  l)  +  <i(’i)-X’M(’i)] 


=W'M(n-l)  +  GM(n)XM(n) 


(2-16) 


From  (2-i5),  the  frequencies  fj  can  readily  be  obtained: 


fi  =  (cos  *  Y)/(27r) 


(2-17) 


Now  that  we  have  found  the  frequencies  embedded  in  the  input  signal;  we  next  turn 
our  attention  to  finding  the  real-valued  amplitudes  of  these  frequencies.  Rewriting  (2-1), 


«;•(«)  «y(^) 


(2-18) 


We  have  an  estimate  of  uy(n)  using  (2-2)  and  Wj  in  (2-15).  We  now  need  to  estimate 
ay.  Let  the  error  of  estimating  the  amplitudes  be  defined  as: 


*(")  =  *(")  - 


(2-19) 


where: 


=(ai(n),  a2(n),  ...,  om(^)] 


Defining  the  squared  error  as: 

=  (2-20) 
i=l 

We  will  now  use  the  gradient  search  method  again  to  obtain  an  estimate  of  the  amplitudes: 


d£(n) 

dAM(n} 


=  &«(.■) 

i=l 


=  0 


(2-21) 


and 


AM(n)  = 


(2-22) 


Where 

t=l 

and 

n 

?«(")= L*"‘‘‘'i-)^"(*') 

»=i 

Following  the  same  developments  as  the  frequency  estimation  case,  we  again  use  the  same 
matrix  lemma  to  obtain  an  RLS  solution  to  the  amplitude  estimation. 


-  J 


-RmmI")  =  -  1)  +  (2-23) 

-  1)  -  -  1)1 
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Define 


(2-25) 

7?  Guuin-  l)UM(n) 

(2-26) 

-  l)^M(n) 

(2-27) 

Substituting  (2-25),  (2-26),  and  (2-27)  into  (2-24)  and  amplifying,  we  obtain: 

C'mm(«)  =  ~  1)  ~  GM{'^)Ulf[n)C MM{n  -  1)  (2-28) 

We  now  have  an  estimate  of  the  amplidue  by  putting  (2-28)  into  (2-22): 

(2-29) 

or 

=  Auin  -  1)  +  GM{n)e(n)  (2-30) 

Estimatine  frequencies  and  complex  amplitudes  for  complex  signal  bv  the  RLS  algorithm 
In  the  previous  case,  we  have  shown  how  the  RLS  algorithm  can  be  used  to  estimate 
real-valued  frequencies  and  real-valued  amplitudes  from  real-valued  input  data.  In  this 
section,  we  will  further  extend  the  RLS  algorithm  for  complex  input  data  to  obtain  real¬ 
valued  frequencies  and  complex  valued  amplitudes.  The  basic  algorithm  is  the  same  as 
the  previous  case  in  principle  except  caution  must  be  exercised  to  distinguish  real-valued 
parameters  from  complex  parameters. 

To  represent  the  complex  input  data,  we  will  use  the  complex  exponential  format 
rather  than  the  simple  sine  function  format: 

M 

=  (2-31) 
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The  removal  of  a  complex  exponential  is  accomplished  by  the  same  operation.  We  define 
the  following: 


xi (n)  =x(n)  -  ufix{n  ~  1)  +  i(n  -  2)  (2-32) 

X2(n)  =ii(n)  -W2Xi{n  -  1)  -c  2i(n  -  2) 


iA/(n)  =XA/_i(n)  -  -  l)  +  XM-i(n  -  2)  (2-33) 

If  there  are  only  M  complex  exponentials,  the  nzM(n)  would  represent  the  noise  signal 
after  all  M  sin\isoids  have  been  removed.  This  error  signal  can  now  be  rewritten  in  matrix 
format: 

eW  =  XM(n)  =  d(n)  -  Xlt[n)WM{n)  (2-34) 

where 


d(n)  =i(n)  -|-x(n-2)  -l-Xi(n-2)  +X2{n-2)  +  . . . -h  XM-i(n  -  2) 

■^5r(n)  =  [x(n  -  1)  xi(n-l)  X2(n  -  1)  ...  XM-i{n-l)] 

^M{n)  =  \u)x  W2  tl»3  ...  Wm]^ 

The  squared  error  for  the  frequency  estimation  is  real-valued  and  can  be  expressed 
as  follows: 

n 

=  (■>(•)  (2-35) 

»=l 

The  asterisk  indicates  a  complex  conjugate  operation.  The  gradient  of  the  squared  error 
in  terms  of  the  frequency  vector  can  be  found  by  differentiating  the  above  equation. 
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Setting  the  gradient  to  zero,  we  have  the  optimal  solution  for  the  weight  vector: 

^  ^  +  e(.)Ar;;,(.)l  =  o  (^36) 

or 

(2-37) 

where 

SmmW  =  X;a“-|Xm(ox;J(0  +  yjuwxJwi 

i-l 

P„(n)  =  f^A”-lXM(i)i-(0  + 

tsl 

Note  that  i®  ^  real-valued  and  symmetric  covariance  matrix.  We  want  to 

evaluate  in  a  recursive  manner.  Rewriting  the  above  definition  for  in 

an  iterative  manner,  we  obtain: 


RMM{n)  =  -  1)  +  XM[n)XM  (n)  +  Xlf[n)Xlf{n) 


(2-38) 

(2-39) 


Since  there  are  three  terms  on  the  right  hand  side  in  (2-38),  the  matrix  inversion  formula 
must  be  applied  twice.  In  other  words,  we  first  find  the  inverse  of  terms  of 

~  ^)’  inverse  of  ~  1)  in  terms  of  —  !)•  First,  we 

will  find  •RJ[^^(n)  in  terms  of  R^j^in  —  l)  using  (2-39): 


Rm^mI”)  -  RmmI^  “  1)  "  1  4.  VT  P'-I 

If  we  now  define  the  parameters  as  follows: 


(2-40) 


CMM{n)  =RKf\ 


(2-41) 
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q\n)  — —  l)XX/(n) 

- TT7W) - 

then 

^MM(n)  =C'i^f^(n  -  1)  -  G'j^{n)X^{n)CMf^{n  -  1) 


Next  we  need  to  find  the  inverse  of  RMM{n  —  l)  in  terms  of  1)- 

and  (2-39),  we  have  the  expression  for  *M«("  -  1): 


PR'umI^  -  1)  =-'KMM(n  -  1)  +  Xu{n]X-J (n) 

and 


The  iterative  expression  for 


>'-i 

‘•MM 


(n  —  1)  is: 


•Rmm(»»  -  1) 


X  ^  X  X  +  X;J{n)R^^^{n-l)Xj^{n) 


If  we  now  further  define  additional  parameters  as  follows: 


q(n)  =  XjJ -  1)-X’m(»») 
_  C’mm(«  -  l)XM{n) 

-  — rr^w — 

Then  —  1)  in  (2-44)  can  be  expressed  as  follows: 


-  1)  =  \\CMM{n  -  1)  -  G«(n)Arj^ (n)C„M(n  -  1)] 


(2-42) 
From  (2-38) 

(2-13) 

(2-44) 

(2-45) 


(2-46) 


Substituting  (2-42)  and  (2-46)  into  (2-37),  we  obtain  an  iterative  expression  for  the  fre¬ 
quency  vector: 

WM{n)  = 

-  ~  1)  ~  ~  ~  1) 

=  -  1)  +  Gi^{n)AM(n)  {2-47; 

But  from  (2-46)  we  have: 

w;^(n  -  1)  =  Cj^j^(n  -  l)Pj^(n  -  l) 

=  -  1)  “  Gm{ti)X}J (n)CA/A/(n  -  l)](APA/(n  -  1)  +  XM[n)d’ [n)] 

=  W^M(n  -  1)  +  GAf(n)Xj(^(n)  (2-48) 

Substituting  (2-48)  into  (2-47),  we  obtain  the  final  expression  for  the  optimal  weight 
vector: 

WM{n)  =  WM{n  -  1)  +  GM{n)X;^[n)  +  G'j^{n)XM{n)  (2-49) 

Close  examination  of  the  above  equation  shows  that  the  last  two  terms  are  real-valued 
vectors  thus  always  making  the  Wj^{n)  vector  real-valued.  The  elements  of  H^M(n)  vector 
are  exactly  the  frequencies  of  the  complex  sinusoids. 

Next,  we  proceed  to  estimate  the  complex  amplitudes  by  the  same  RLS  algorithm. 
The  derivation  parallels  that  of  the  real-valued  case  developed  in  the  previous  section.  Let 
the  input  and  the  estimation  signal  be  given  as  follows: 

M 

z(n)  =  (2-50) 
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(2-61) 


*(’>)  =  ((«(•>) 


The  estimation  error  is: 


e(n)  =  x(n)  - 


(2-52) 


and  the  squared  error  is: 


(2-53) 


Since  the  amplitudes  are  complex,  we  must  treat  the  real  and  imaginary  part  separately. 
The  gradient  of  the  real-valued  square  error  with  respect  to  the  real  and  imaginary  past 
of  the  complex  amplitudes  are: 


c-(.-)(-&„(.))|  (2-54a) 

oAR[n) 

e-(.l(;^M(.-))l  (2-54b) 


Combining  the  real  and  imaginary  parts,  we  have: 


^  =  Ea-MO&mC.-) 

A#  (»i)  SAr  dAj 


(2-55) 


The  optimal  estimation  for  A  is  found  by  setting  the  above  gradient  to  zero.  This  results 


in  the  following: 


^  X'-'ldii)  -  .4j,(n)&„(0l^;:,(.-)  =  0 


(2-56) 


a4m(w)  =  RMM{^)PM{n) 


(2-57) 


where 
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tsl 

Using  the  same  principles  as  before  to  arrive  at  an  iterative  solution,  can  be 

rewritten  as: 

=  A^A/A/{n  —  1)  +  (2-58) 


PM[n)  =  APM(n  -  1)  ^M(n)d(n) 


(2-59) 


The 


inverse  of  RMM{n)  can  be  found  in  similar  fashion: 


R-^ 


-R-'  (n  1)  ^ 


(2-60) 


where 


9(n)  -^5(n)PMM(n-l)C^M(n) 

- aTIH - 

and 

CA/M(n)  =-RaA/{”) 

then 

Ca/m {^)  =  jlCMM(n  -  1)  -  GM(’i)U^(n)CMM(n  -  1)]  (2-61) 

Substituting  (2-61)  into  (2-57),  we  obtain  an  expression  for  the  complex  amplitudes: 

=  jpMM(n  -  1)  -  5M(n)u5(n)C„M(n  -  l)l|A?M(n  -  1)  +  C>M(n)<i(>.)i 
=  AM{n-l) +GM{n)  ■  e(n)  (2-62) 


Simulation  Results 


The  DAFE  algorithms  tising  the  LMS  or  RLS  method  has  been  simulated  on  the 
IBM  3081  computer  using  single  precision  arithmetic.  For  the  present  report,  we  only 
include  the  results  of  a  single  test  case  based  on  real-valued  data.  Other  simulation  runs 
nevertheless  have  shown  that  indeed  the  algorithm  does  converge  correctly  for  complex 
input  data  when  estimating  the  real-valued  frequencies  end  complex  amplitudes. 

Figure  2.1  shows  the  learning  curve  for  the  DAFE-LMS  and  DAFE-RLS  methods  in  a 
sample  individual  run.  Note  that  though  the  DAFE-LMS  curve  seemingly  has  a  lower  error 
magnitude  at  the  beginning,  the  convergence  rate  is  nevertheless  very  6!^w  at  the  end.  For 
this  particular  simulation  run,  the  DAFE-LMS  algorithm  converges  rather  rapidly  after 
500  iterations  rotate  the  DAFE-RLS  algorithm  takes  twice  as  many  interations.  Figure  2.2 
and  2.3  shows  the  trajectories  of  frequency  estimations  of  all  fo\ir  frequencies  as  a  function 
of  the  number  of  points  for  the  DAFE-LMS  and  DAFE-RLS  algorthims,  respectively. 
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Figure  2.1  Learning  curve  for  the  DAFE— LMS  and  DAFE— RLS  algorithms 


S.  Effects  of  partially  received  signals 

The  second  aspect  of  this  contract  b  to  investigate  the  effects  of  partially  received 
signab  on  various  frequency  estimation  methods.  A  partially  received  signal  b  a  short 
signal  that  begins  or  ends  during  a  block  of  sampled  data,  i.e.  within  a  sampling  window. 
In  other  words,  because  the  signal  duration  b  so  short,  some  signab  would  always  start 
during  the  sampling  window  while  others  would  always  end  during  the  sampling  window. 
In  real  life  situations,  it  may  not  often  be  possible  to  capture  a  signal  that  lasts  longer 
than  the  sampling  window.  Even  then,  other  signals  present  would  often  not  be  present 
for  the  whole  sampling  window  unless  the  sampling  window  b  trivially  short. 

The  effect  of  partially  receiving  a  signal  during  the  sampling  window  b  equivalent  to 
windowing  the  fully  received  signal  by  a  small  rectangular  window.  In  the  periodogram, 
the  windowed  signal  has  the  effect  of  convolving  the  spectral  estimate  with  the  transform 
of  the  windowing  function.  The  result  of  thb  convolution  b  a  loss  of  frequency  resolution. 
Thb  effect  should  be  the  same  for  the  other  frequency  estimation  methods. 

Detailed  theoretical  analyses  for  the  periodogram  and  the  modified  covariance  method 
b  given  in  the  master  thesb  of  Mr.  Gopinath  Benigeri  who  has  also  carried  out  the 
simulation  of  partially  received  signab  for  different  frequency  estimation  methods. 

A  p  percent  window  b  defined  as  follows.  Let  there  be  M  points  in  the  window 
function  with  non-zero  values  such  that 


^  X  100%  =  p% 
N 


For  example  in  an  80%  window  (p  =  80%)  we  have 


—  X  100  or  M  =  — - 
N  10 


4N 

5 
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so  there  are  ^  value*  in  a  rectangular  window  of  width  N  with  values  =  1  and  the  rest 


are  tero. 

The  rectangular  data  window  is  defined  as 

,  ^  f  1  l<n<N-l 
win)  =  <  ,  , 

1 0  elsewhere 

The  fourier  transform  of  the  rectangtilar  data  window  u;(n)  is 

OO 

W{f)  =  ^ 

«=— OO 

stn  tt/ 

W{f)  is  known  as  the  Dirichlet  kernel. 


(3-1) 


(3-2) 


Equivalent  Noise  Bandwidth: 

The  equivalent  noise  bandwidth  of  a  window  is  the  width  of  a  rectangular  window  with 
peak  signal  power  gain  that  would  accumulate  the  same  noise  power.  The  accumulated 
noise  power  of  a  window  is  defined  by 

Noise  Power  =  /  l^(^)|*~»  w  =  2irf  (3-3) 

J-k/t  27r 

where  No  is  the  noise  power  per  unit  bandwidth.  Assuming  T  =  1,  and  using  Pareval’s 
Theorem,  we  get 

=  K  |t»(n)p  (3-4) 

ns^oo 

The  peak  signal  gain  and  the  peak  power  gain  both  occur  at  /  =  0.  Therefore,  the  peak 
signal  gain  W(o)  and  the  peak  power  gain  W^{o)  can  be  expressed  as  follows; 

OO 

^(0)  =  5^ 

n=*oo 
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Noise  Power 


r  oo 


w\0)  =  Y.  “I"! 

ro 

So  the  equivalent  noise  bandwidth  is  given  as  follows  after  normalizing  by  Npi 


ENBW  = 


Noise  Power 
Peak  Power  Gain 


E 

ns— 00 


(3-5) 


The  limits  of  n  are  assumed  to  be,  in  general,  from  — oo  to  oo.  When  u)(n)  is  substituted 
by  a  particular  window,  the  values  of  n  are  bounded  by  the  region  where  the  window  is 
defined. 

Let  MS  assume  that  we  have  a  p%  rectangular  window.  The  p%  rectangulair  window 
is  defined  as 

n  0  <  n  <  A/  -  1 

(3-6) 


'  '  lo  Af  <n<  J^-1 


The  equivalent  noise  bandwidth  for  this  window  'is  as  follows: 


M-l 


ENBW[M)  =  W^[n)J  ^  W[n) 


n=C 


M-1 


'>»=0 


_  J_ 
“  M 


(3-7) 


The  E  NBW  is  now  seen  to  be  a  function  of  the  width  M  of  the  window. 

Scalloping  Loss: 

Let  us  assume  that  we  have  a  signal  consisting  of  a  complex  sinusoid  and  additive 
noise.  The  signal  can  be  represented  as  follows: 


/(n) 


(3-8) 
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where  ^(n)  is  a  Gaussi&n  noise  process  with  vaj-iance  The  Fourier  Transform  of  the 
signal  component  of  the  input  after  being  processed  through  a  window  is  given  by  the 
matched  filter  output; 


s-i 

Fiwk)  = 

n=0 

iV-1 

=  *"(»»)  (3-9) 

nsO 

This  is  also  called  the  coherent  gain.  Here  we  have  implicitly  assumed  that  the  signal 
frequency  w*  is  a  multiple  of  the  sampling  frequency  If  the  signtd  frequency  Wk  lies 
midway  between  the  two  adjacent  sampling  frequency  multiplies,  i.e. 


u'*  =  {^  +  where  tw,  =  2»r/, 

Then  the  change  in  the  coherent  gain  due  to  this  half  bin  shift  is 

s-i 

JCwi/s)  *  A  ^  where  u/j/j  =  ^ 


or 


/f~i 


/•(<»./!)  =  .4  ^  wln)r’i’ 


(3-10) 


nsO 


The  scalloping  loss,  which  is  defined  as  the  ratio  of  the  coherent  gain  for  a  signal  frequency 


located  at  half  a  bin  away  to  that  located  at  the  DPT  bin,  is 


Scalloping  loss  = 


E  “'W 

fi=0 


(3-11) 


For  a  rectangular  window,  which  is  p%,  the  scalloping  loss  will  be  a  function  of  M.  It  is 
given  by 


M-l 

I  E  «■'*“! 

Scalloping  Loss  = 


M 


(3-12) 
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Simulation  Results 

The  effect  of  partially  received  signals  has  been  simulated  in  FORTRAN  on  the  IBM 
3081  for  three  methods: 

1.  The  Burg  Method 

2.  The  Modified  Covariance  Method 

3.  The  Adaptive  RLS  Method 

For  each  method,  we  have  simulated  the  data  for  a  signal  window  of  100%,  80%,  60%,  40%, 
and  20%.  Figure  3.1  shows  the  effect  of  diminishing  window  size  for  the  Burg  Method. 
Figure  3.2  and  3.3  shows  the  same  for  the  modified  covariance  method  and  the  adaptive 
RLS  method  respectively. 
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4.  Perforzziance  comparUon  of  frequency  estimation  methods 

The  third  aspect  of  the  present  research  effort  is  to  compare  the  performance  of  various 
frequency  estimation  methods.  The  methods  chosen  for  this  purpose  include: 

1.  The  Periodogram 

2.  The  Modified  Covariance  Method 

3.  The  Burg  Method 

4.  The  Adaptive  RLS  Method 

5.  Prony’s  Method 

6.  The  SVD  Method 

Among  these  six  methods,  the  periodogram  serves  as  a  control  for  comparison  with  other 
methods.  The  modified  covariance  method  is  based  on  the  input  covariance  matrix.  Au¬ 
toregressive  coefficients  are  estimated  directly  from  the  covariance  matrix  and  the  power 
spectral  density  function  is  computed  from  the  estimated  autoregressive  coefficients.  The 
Burg  Method  gives  an  estimate  of  the  power  spectral  density  using  an  estimate  of  the 
reflection  coefficients.  In  the  adaptive  RLS  Method,  the  autoregressive  coefficients  are 
computed  directly  from  the  input  signal  in  an  iterative  manner  whenever  a  new  sample 
point  is  available.  In  Prony's  Method,  the  input  data  is  reduced  into  a  set  of  polynomi- 
nal  coefficients.  The  roots  of  the  characteristic  polynomial  correspond  to  the  frequencies 
of  the  signal.  In  the  signular  value  decomposition  (SVD)  method,  the  eigenvalues  of  the 
covariance  matrix  are  related  to  the  desired  frequencies. 

To  test  the  effectiveness  of  all  these  six  methods,  two  standard  sets  of  test  data  are 
generated  and  applied  to  these  methods.  In  the  first  test,  a  single  frequency  embedded 
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Figure  3.2  Effect  of  diminishing  percentage  of  received  signal  in  the  ModiSed  Covariance  Klethod 
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in  additive  Gavmian  noise  for  testing  the  performance  of  the  methods  including  the  bias 
and  the  mean  square  error  of  frequency  and  amplitude  estimation.  In  the  second  test, 
two  frequencies  are  present  in  the  input  signal.  These  two  frequencies  vary  in  frequency 
independently  from  the  low  end  of  the  frequency  scale  to  the  upper  end  of  the  frequency 
scale.  The  purpose  of  this  test  is  to  explore  the  effects  of  one  signal  on  the  other. 


Test  One:  Single  Sinusoid  in  Noise 

In  the  first  test  case,  we  have  a  single  sinusoid  embedded  in  white  noise.  The  purpose 
of  this  test  is  to  examine  the  performance  of  the  six  methods  with  respect  to  various  signal 
to  noise  ratio.  Specifically,  we  will  examine  the  bias  and  mean  signal  error  of  the  frequency 
and  amplitude  estimations  of  these  six  methods. 

Figure  4.1  shows  the  bias  and  mean  squared  error  for  all  six  methods.  To  vi<;ualize  the 
effect  of  noise  level  on  the  bias  of  frequency  estimation,  we  have  plotted  the  frequency  bias 
as  a  function  of  signal  to  noise  ratio  (SNR).  These  plots  are  fhown  in  Figure  4.2.  From 
Figure  4.2,  it  is  evident  that  the  SVD  method  appears  to  perform  well  even  in  low  signal 
to  noise  ratio. 

From  Figure  4.2a,  we  see  that  the  bias  in  frequency  estimation  for  the  periodogram 
begins  to  increase  when  SNR  drops  below  10  DB.  For  the  modified  covariance  method, 
frequency  bias  is  minimized  xmtil  the  SNR  drops  below  5  DB.  The  same  is  observed  for  the 
Burg  Method.  Somewhat  inconsistent  results  are  observed  in  the  adaptive  RLS  method. 
This  inconsistency  is  due  mostly  because  a  short  data  sequence  (64  points)  is  used  so  that 
convergence  was  not  obtained.  The  Prony’s  Method  seems  to  have  minimum  frequency 
bias  until  SNR  drops  below  -5  DB.  The  same  is  observed  also  for  the  SVD  method. 
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SNR 

(db) 

Bias 

(Freq.) 

Bias 

(Amp.) 

Std.  of 
Mae 
(Freq.) 

Std.  of 
Mse 
(Amp.) 

C.R.B. 

(Freq.) 

C.R.B 

(Amp.) 

35.0 

.1053e-3 

.1099e-2 

.1953e-3 

.1254 

.9576e-5 

.7812e-2 

30.0 

.1953e-3 

.1257e-l 

.1953e-3 

.1339 

.1703*^ 

.7812e-2 

25.0 

.1953e-3 

.1907e-2 

.1953e-3 

.1246 

.302&e-4 

.7812e-2 

20.0 

.1953e-3 

.7266e-2 

.1953e-3 

.1112 

.5385e-4 

.7812e-2 

15.0 

.1270e-3 

.2793e-l 

.2797e-3 

.1074 

.9576e-3 

.7812e-2 

10.0 

.8786e-4 

.I752e-1 

.4212e-3 

.1340 

.1703€-3 

.7812e-2 

5.0 

.9589e-l 

.1117e+l 

.1386 

.1628 

.5385e-3 

.7812e-2 

0.0 

.1880 

.1187e-T  1 

.1939 

.1232 

.5385e-3 

.7812e-2 

-5.0 

.2000 

.6309 

.2000 

.6362 

.9576e-3 

.7812e-2 

-10.0 

.2000 

.2901 

.2000 

.3016 

.1703e-2 

.7812e-2 

The  Periodogram 

SNR 

Bias 

Bias 

Std.  of 
Mse 

Std.  of 
Mse 

C.R.B. 

C.R.B. 

(db) 

(Freq.) 

(Amp.) 

(Freq.) 

(Amp.) 

(Freq.) 

(Amp.) 

35.0 

,5176e-3 

.7553 

.7112e-2 

.7371e-|-l 

.9576e-5 

.7812e-2 

30.0 

.1953e-3 

.1317e-l 

.1953e-3 

.1135 

.1703e-4 

.7812e-2 

25.0 

.1953e-3 

.1605e-l 

.1953e-3 

.1223 

.3028e-4 

.7812e-2 

20.0 

.1953e-3 

.2171e-l 

.2522e-3 

.12% 

.5385e-4 

.7812e-2 

15.0 

.1464e-3 

.2431e-l 

.2584e-3 

.9956e-l 

.9576e-4 

.7812e-2 

10.0 

.4882e-4 

.6416e-2 

.3782e-3 

.1289 

.1703e-3 

.7812e-2 

5.0 

.6837e-4 

.17720-1 

.63l4e-3 

.1280 

.5385e-3 

.7812e-2 

-5.0 

.1330 

.3976 

.1630 

.5214 

.9576e-3 

.7812e-2 

-10.0 

.1940 

.2626 

.1970 

.2847 

.1703e-2 

.7812e-2 

b)  The  Modified  Covariance  Method 


Figure  4.1  Bias  and  mean  square  error  of  frequency  and  amplitude  estimation  for  all  six  methods 
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SNR 

Bias 

Bias 

Std.  of 
Mae 

Std.  of 
Mae 

C.R.B. 

C.R.B 

(db) 

(Freq.) 

(Amp.) 

(Freq.) 

(Amp.) 

(Freq.) 

(Amp.) 

35.0 

.1953e-3 

.1381e-l 

.1953e-3 

.1272 

.9576e-5 

.7812e-2 

30.0 

.1953e-3 

.2229e-l 

.1953e-3 

.1299 

.l703e-4 

.7812e-2 

25.0 

.1885e-3 

.1913e-l 

.2095e-3 

.1172 

.3028e-4 

.7812e-2 

20.0 

.3904e-4 

.1812e-l 

.3601e-3 

.1250 

.5385e-4 

.7812e-2 

15.0 

.4882e-4 

.1326e-l 

.5074e-3 

.1243 

.9576e-3 

.7812e-2 

10.0 

.0000 

.1520e-l 

.9468e-3 

.1295 

.l703e-3 

.7812e-2 

5.0 

.3910e-4 

.1724e-l 

.1237e-2 

.1273 

.3028e-3 

.7812e-2 

0.0 

.8223e-2 

.5083e-l 

.4004e-l 

.2769 

.5385e-3 

.7812e-2 

-5.0 

.1442 

.4353 

.1697 

.5466 

.9576e-3 

.7812e-2 

-10.0 

.1980 

.2919 

.1990 

.3077 

.1703e-2 

.7812e-2 

c) 

The  Burg  Method 

SNR 

Bias 

Bias 

Std.  of 
Mae 

Std.  of 
Mae 

C.R.B. 

C.R.B 

(db) 

(IVeq.) 

(Amp.) 

(Freq.) 

(Amp.) 

(Freq.) 

(Amp. 

35.0 

.1953e-3 

.6601e-l 

.1953e-3 

.1284 

.9576e-5 

.7812e-2 

30.0 

.1953e-3 

.1492e-l 

,1953e-3 

.1280 

.1703e-4 

.7812e-2 

25.0 

.2930e-3 

.1215e-l 

.4143e-3 

.1554 

.3028e-4 

.7812e-2 

20.0 

.8301e-2 

.1351e-l 

.2588e-l 

.4312e-l-l 

.5385e-4 

.7812e-2 

15.0 

.2363e-l 

.1507e+l 

.5302e-l 

.3314e+l 

.9576e-4 

.7812e-2 

10.0 

.2002e-l 

.3738 

.6325e-l 

.1351e-l-l 

.1703e-3 

.7812e-2 

5.0 

.1563e-l 

.1115e+l 

.1084 

.1495e-|-l 

.3028e-3 

.7812e-2 

0.0 

.1311 

.8527 

.1589 

.1045e-|-l 

.5385e-3 

.7812e-2 

-5.0 

.1868 

.5986 

.1910 

.6079 

.9576e-3 

.7812e-2 

-10.0 

.1333 

.2846 

.1770 

.2922 

.1703e-2 

.7812e-2 

d)  The  Adaptive  RLS  Method 


Figure  4.1  (Con’t.) 
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SNR 

Bias 

Bias 

Std.  of 
Mse 

Std.  of 
Mse 

C.R.B. 

C.R.B 

(db) 

(Freq.) 

(Amp.) 

(Freq.) 

(Amp.) 

(Freq.) 

(Amp.) 

35.0 

.OOOOe-0 

.7824 

.1306e-4 

.1021e+l 

.95766-5 

.7812e-2 

30.0 

.1574e-4 

.1280€+1 

.2553e-4 

.1436e-l-l 

.1703e-4 

.7812e-2 

25.0 

.9000e-5 

.2510e^l 

.4685e-4 

.2619e+l 

.3028e4 

.7812e-2 

20.0 

.2623e-4 

.4352e+l 

.9350e-4 

.4442e-M 

.53S5e-4 

.78l2e-2 

15.0 

.4244e-4 

.5743e-l-l 

.l786e-3 

.57746+1 

.9576e-4 

.7812e-2 

10.0 

.8106e-5 

.4032e-(-l 

.6325e-l 

.4057e+l 

.1703e-3 

.7812e-2 

5.0 

.8035e-3 

.1996e+l 

.1414 

.2108e+l 

.3028e-3 

.78l2e-2 

0.0 

.1795e-2 

.8555 

.1789 

.1231 

.5385e-3 

.7812e-2 

-5.0 

.4144e-2 

.7965 

.1838 

.1074e+l 

.9576e-3 

.7812e-2 

-10.0 

.5852e-l 

.7387 

.2174 

.9565 

.l703e-2 

.7812e-2 

e)  Prony’s 

Method 

SNR 

Bias 

Bias 

Std.  of 
Mse 

Std.  of 
Mse 

C.R.B. 

C.R.B 

(db) 

(Freq.) 

(Amp.) 

(Freq.) 

(Amp.) 

(Freq.) 

(Amp.) 

35.0 

.1953e-3 

.6653e-2 

.1953e-3 

.1259 

.95766-5 

.78126-2 

30.0 

.1953e-3 

.3972e-l 

.1953e-3 

.1177 

.17036-4 

.78126-2 

25.0 

.1953e-3 

.63l7e-2 

.1953e-3 

.1047 

.30286-4 

.7812e-2 

20.0 

.1953e-3 

.3010e-l 

.1953e-3 

.9838e-l 

.53856-4 

.7812e-2 

15.0 

.1953e-3 

.1869e-l 

.1953e-3 

.1013 

.9576e-4 

.78126-2 

10.0 

,9763e-4 

.3285e-l 

.3088e-3 

.9504e-l 

.1703e-3 
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f)  The  SVD  Method 


Figure  4..1  (Con’t.) 
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To  test  the  consistency  of  the  performance  for  these  six  methods  the  simulation  is 
repeated  for  100  independent  trials.  If  the  frequency  bias  is  larger  than  a  frequency  bin 
or  if  the  estimated  amplitude  falls  outside  3  DB  of  the  expected  amplitude,  the  estimation 
run  is  regarded  as  an  error.  We  plot  in  Figure  4.3  the  estimation  error  as  a  function 
of  SNR  and  frequency  for  all  six  methods.  The  size  of  the  circle  in  the  plots  indicate 
the  percentage  of  error  at  that  frequency  for  that  SNR.  The  results  shown  in  Figure  4.3 
confirms  our  observations  disciissed  previously. 

We  are  interested  in  comparing  the  mean  squared  error  for  frequency  estimation  of  all 
the  methods  to  the  Cramer  Rao  bound.  This  is  shown  in  Figure  4.4.  It  is  readily  observed 
from  Figure  4.4  that  the  SVD  method  performs  well  particularly  in  low  signal  to  noise 
ratio  conditions. 

Test  Two:  Two  Sintisoids 

In  the  second  test  case,  we  use  two  sin\isoids,  eEu:h  with  its  frequency  sweeping  from 
low  frequency  to  high  frequency.  The  results  for  these  simulations  are  shown  in  Figures 
4.5  and  4.7.  We  plot  the  percentage  of  error  as  a  function  of  the  two  varying  frequencies 
for  SNR  =  25  DB  in  Figme  4.5,  for  SNR  =  10  DB  in  Figure  4.6,  and  for  SNR  =  5  DB  in 
Figure  4.7.  Examination  of  these  three  figures  shows  that  the  performance  of  frequency 
resolution  tends  to  degp-ade  as  SNR  decreases. 

For  the  sake  of  comparison,  we  have  selected  a  single  case  with  frequencies  0.2  and 
0.21.  These  values  are  chosen  in  such  a  way  that  the  frequency  0.2  is  right  on  the  frequency 
bin  while  0.21  is  not.  Again,  the  mean  squared  error  of  the  frequency  estimation  is  mea- 
Bxired  and  compared  to  the  Cramer  Rao  bound.  The  plot  for  f  =  0.20  is  shown  in  Figure  4.8 
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Figure  4.3  Plots  of  estimation  error  percentage  as  a  function  of  SNR  and  frequency  for  all 
methods 


six 
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SNR 


Figure  4.4  Comparison  of  the  mean  squared  error  of  all  the  algorithms  to  the  Cramer  Rao  Bound 
for  the  single  sinusoid  (f  =  .20}  case 
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Figure  4.6  Plots  of  estimation  error  percentage  for  all  six  methods  at  SNR  =  10  DB 
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Figure  4.7  Plots  of  estimation  error  percentage  for  all  six  methods  at  SNR  =  5  DB 
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while  that  for  f  =  0.21  is  shown  in  Figure  4.9.  Note  that  when  the  frequency  is  on  the 
bin,  rather  accurate  estimation  can  be  obtained.  When  the  frequency  is  off  the  bin,  poorer 
estimation  is  observed. 
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1  Introduction 

There  is  increased  interest  in  implementing  artificial  intelligence  applications 
in  Ada.  Efforts  in  this  area  encompass  expert  systems  [Adk86,LaV86,JLZ87|, 
distributed  knowledge  based  systems  [Bra86,Fra86],  pattern  directed  pro¬ 
cessing  [RKW85],  semantic  networks  [SPA86|,  reusable  heuristic  search  algo¬ 
rithms  [Dob88]  and  others.  This  paper  concentrates  on  the  implementation  of 
blackboard  (BB)  systems  in  Ada.  It  describes  the  blackboard  control  model 
of  problem  solving,  a  generic  approach  to  the  design  and  implementation  of 
a  blackboard  control  system,  and  the  application  of  the  approach  to  a  classic 
defense  problem. 

2  Blackboard  Model 

A  problem  solving  model  is  a  scheme  that  constructs  a  solution  by  organiz¬ 
ing  reasoning  steps  and  domain  knowledge.  A  model  provides  a  conceptual 
framework  for  organizing  knowledge  and  a  strategy  for  applying  that  knowl¬ 
edge.  Examples  of  problem  solving  models  include  forward  reasoning  models, 
backward  reasoning  models,  event  driven  models,  model  driven  models,  etc. 
In  a  forward  reasoning  model,  the  inference  steps  are  applied  from  2ui  initial 
state  toward  a  goal.  In  a  backward  reasoning  model,  problem  solving  begins 
by  reasoning  backward  from  a  goal  to  be  achieved  toward  an  initial  state. 

The  blackboard  model  [Nii86b,Nii86a],  which  was  developed  in  the  1970’s 
and  has  undergone  very  few  changes  in  the  last  ten  years,  uses  an  opportunis¬ 
tic  reasoning  model.  In  an  opportunistic  reasoning  model,  pieces  of  knowl¬ 
edge  are  applied  either  backward  or  forward  at  the  most  opportune  time.  This 
model  was  first  abstracted  from  the  Hearsay- II  speech  understanding  system 
[Nii82]  and  applied  to  the  design  and  implementation  of  the  HASP  system 
for  ocean  surveillance  [EHRLRSO].  Many  application  programs  have  sub¬ 
sequently  been  implemented  (usually  in  Lisp)  using  the  blackboard  model. 
These  include  systems  for  interpreting  electron-density  maps,  planning  er- 
rainds,  understanding  military  signals,  and  understanding  images. 

The  basic  blackboeu-d  model  is  usually  described  as  consisting  of  three 
major  components  -  the  knowledge  sources,  the  blackboard  data  structure, 
and  the  control  -  as  shown  in  the  figure  on  the  following  page. 
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( - data  flow) 


The  dom<un  knowledge  sources  aie  partitions  formed  from  the  total  do¬ 
main  knowledge  that  can  be  used  to  solve  the  problem.  These  knowledge 
sources  are  logically  independent  and  kept  separate.  The  domain  blackboard 
data  structure  is  a  global  data  base  that  holds  the  problem  solving  state  data. 
The  solution  space  is  orgamized  into  one  or  more  application  dependent  hier¬ 
archies.  Information  at  each  level  of  the  hierarchy  represents  partial  solutions 
and  is  associated  with  a  unique  vocabulary  that  describes  the  information. 
The  knowledge  sources  produce  changes  to  the  blackboard  that  lead  incre¬ 
mentally  to  a  solution  to  a  problem.  Communication  and  interaction  among 
knowledge  sources  take  place  solely  through  the  blackboard.  There  is  no  pre¬ 
determined  flow  of  control.  The  knowledge  sources  respond  opportunistically 
to  changes  in  the  blackboard.  The  knowledge  sources  transform  information 
on  one  level  of  the  hierarchy  into  information  on  the  same  or  other  levels 
using  algorithmic  procedures  or  heuristic  rules  that  generate  actual  or  hy¬ 
pothetical  transformations.  Which  knowledge  source  to  apply  is  determined 
dynamically,  one  step  at  a  time,  resulting  in  the  incremental  generation  of 
partial  solutions.  The  choice  of  a  knowledge  source  is  based  on  the  solution 
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state  and  on  the  existence  of  knowledge  sources  capable  of  improving  the 
current  state  of  the  solution. 

There  are  at  leaist  two  different  approaches  to  handling  the  control.  The 
first  has  control  residing  in  a  set  of  procedures  which  monitor  the  changes  on 
the  domain  blackboard  and  trigger  appropriate  knowledge  sources  to  improve 
the  solution  state.  In  the  second  approach,  control  is  achieved  by  placing  the 
strategy  on  a  control  blackboard.  The  decision  then  as  to  what  to  do  next  is 
made  by  control  knowledge  sources  using  the  control  blackboard  where  data 
describing  the  state  of  the  current  strategy  exists.  Strategies  can  be  enabled 
on  the  control  blackboard  to  reflect  the  current  state  of  the  problem  solu¬ 
tion.  This  second  approach  is  referred  to  as  a  blackboard  control  architecture 
[HR85]. 

The  blackboard  architecture  approach  improves  on  traditional  expert  sys¬ 
tems  for  solving  ill-structured  problems  in  the  following  ways.  First,  the 
blackboard  approach  requires  no  a  priori  determined  reasoning  path.  Because 
ill-structured  problems  often  do  not  have  a  predetermined  decision  path  to  a 
solution,  the  selection  of  what  to  do  next  must  be  made  while  the  problem  is 
being  solved.  The  capability  to  do  this  is  provided  in  blackboard  systems  by 
the  incremental  and  opportunistic  problem  solving  approach.  Second,  vague 
information  and  knowledge,  which  characterize  ill- structured  problems,  need 
to  be  made  concrete  in  the  process  of  finding  a  solution  to  the  problem.  The 
blackboard  model  is  an  excellent  tool  for  this  knowledge  engineering  activity. 
During  the  initial  interactions  with  an  expert,  a  knowledge  engineer  tries  to 
find  an  appropriate  conceptual  model  for  the  task  while  trying  to  understand 
the  domain  and  the  nature  of  the  task.  The  blackboard  approach  aids  in  the 
problem  formulation  because  it  provides  some  organizational  principles  that 
are  both  powerful  and  flexible.  The  blackboard  approach  is  also  an  excellent 
tool  for  exploratory  programming,  a  useful  technique  for  developing  solutions 
to  complex  and  ill-structured  problems. 

Although  blackboard  systems  are  useful  for  many  complex,  ill-structured 
problems,  they  are  generally  expensive  to  build  and  to  use.  Therefore,  the 
blackboard  approach  should  not  be  used  when  lower  cost  methods  are  suffi¬ 
cient.  A  problem  which  exhibits  some  combination  of  the  following  charac¬ 
teristics  is  a  good  candidate  for  the  blackboard  approach; 

1.  a  large  solution  space 

2.  noisy  and  unreliable  information 
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3.  a  variety  of  input  data  and  a  need  to  integrate  diverse  information 

4.  the  need  for  many  independent  or  semi-independent  pieces  of  knowledge 
to  cooperate  in  forming  a  solution 

5.  the  need  to  use  multiple  reasoning  methods  or  lines  of  reasoning 

6.  the  need  for  an  evolutionary  solution 

A  proposed  application  should  be  carefully  amalyzed  before  a  decision  to 
implement  a  blackboard  system  is  made. 

3  Application  of  BB  Systems 

The  following  sections  describe  the  domain  analysis,  functional  requirements, 
design,  and  implementation  of  the  prototype  system  for  a  classic  defense 
problem. 

3.1  Domain  Analysis 

Functions  of  a  classic  defense  system  include  sensing  of  the  environment; 
interpretation  of  conflicting,  incomplete  or  corrupted  sensory  data;  integra¬ 
tion  of  sensory  data;  overall  situation  assessment;  planning  eind  real-time 
decisions  for  mission  and  survival  achievement;  and  control  of  weapon  sys¬ 
tem  functions.  These  activities,  which  axe  currently  performed  by  the  crew, 
would  ideally  be  provided  by  a  ground  and/or  onboard  computer  system. 
The  system  would  be  able  to  function  in  an  environment  containing  vast 
amounts  of  raw  data  where  the  data  may  be  unintentionally  corrupted  via 
natural  phenomena  or  intentionally  corrupted  via  friendly  or  enemy  jamming 
and  deception  missions. 

Electronic  warfare  (EW)  processing  spans  many  different  disciplines  (pas¬ 
sive  radio  frequency  (RF)  and  infrared  (IR)  sensor  interpretation,  active  RF 
and  IR  countermeasures  techniques,  etc.)  where  the  sensors  operate  in  dif¬ 
ferent  environments  with  different  requirements  for  information  extraction, 
interpretation  and  reaction.  The  EW  system  must  respond  to  a  dense  and 
dynamic  environment.  A  priori  information  is  used  to  distinguish  threats  to 
the  aircraft  from  non-threats  and  also  contributes  to  the  system  response  and 
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resotirce  allocation  strategy.  A  priori  information  often  does  not  represent 
the  true  state  of  the  environment  because  of  environment  noise,  intentional 
deceptive  emissions,  jamming,  etc. 

From  this  description  it  is  easy  to  see  that  EW  exhibits  several  of  the 
criteria  that  make  a  good  candidate  for  a  blackboard  system.  It  has  a  large 
solution  space  that  includes  knowledge  concerning  static  threats,  limited  re¬ 
sources,  passive  sensors,  terrain  data,  platform  data,  active  countermeasures, 
goals  to  be  accomplished,  etc.  Some  of  the  data  is  noisy  and  unreliable.  There 
is  a  variety  of  input  data  and  a  need  to  integrate  diverse  information.  There 
is  also  the  need  for  many  independent  pieces  of  knowledge  to  cooperate  in 
forming  a  solution.  The  different  types  of  knowledge  require  different  vocab¬ 
ularies  and  different  lines  of  reasoning.  The  solution  evolves  as  new  pieces  of 
information  are  sensed,  input  and  derived  from  existing  data. 

One  EW  defense  problem  is  the  monitoring  of  a  hostile  environment  by 
a  moving  platform  for  the  purpose  of  determining  the  platform’s  best  path 
through  the  environment.  Emissions  are  detected  and  locations  and  types  of 
emitters  are  determined.  A  map  is  produced  which  represents  a  snapshot  of 
the  current  knowledge  about  the  environment.  This  map  can  then  be  used  as 
input  to  a  program  that  will  determine  the  best  path  for  the  platform  through 
this  environment  [DDL88].  This  monitoring  problem  was  chosen  for  the 
prototype  of  the  blackboard  control  system  in  Ada.  The  domain  blackboard 
will  contain  the  current  situation  board  with  respect  to  the  environment 
while  the  control  blackboard  will  contain  data  that  indicates  strategies  and 
reasonings  to  be  employed. 

3.2  Functional  Requirements 

The  functional  requirements  for  the  prototype  include: 

1.  Process  batches  of  input  data  at  regular  intervals  of  4  time  units.  In¬ 
put  data  can  be  either  signal  data  or  intelligence  reports.  Signal  data 
consists  of  three  emitter  characteristics  -  frequency,  pulse  width  and 
pulse  repetition  frequency;  a  location  represented  by  a  position  on  a 
grid;  and  a  signal  detection  time.  Intelligence  report  data,  which  is  for 
threats  only,  consists  of  the  location  and  sighting  report  time. 

2.  Determine  type  of  emitter  from  characteristics  of  each  set  of  signal 
data. 
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3.  Identify  each  emitter  as  threat  or  nonthreat  based  on  the  capabilities 
of  the  emitter  type. 

4.  Post  to  the  current  situation  board  (CSB)  the  locations  of  known 
threats  and  nonthreats  and  the  time  of  sighting. 

5.  Eliminate  duplicate  sightings  but  keep  history  information  for  each 
location. 

6.  Keep  a  list  of  known  friendly  locations  to  determine  whether  any  emit¬ 
ter  identified  as  a  threat  is  a  known  friend. 

7.  Output  CSB  at  regular  intervals  of  10  time  units,  including  location  of 
sighting,  threat  or  nonthreat,  history  information,  and  confidence. 

8.  Output  SOS  message  immediately  when  possible  threat  is  detected. 

3.3  Design 

Two  different  design  methods  were  considered  for  the  system  -  object  ori¬ 
ented  and  functional.  The  choice  of  functional  design  was  based  on  the  fact 
that  blackboard  systems  and  the  blackboard  control  architecture  have  been 
described  functionally  [HR85].  As  the  system  is  decomposed,  the  functional 
design  approach  identifies  major  tasks  to  be  performed.  Subprograms  (pro¬ 
gram  units)  become  the  building  blocks  of  a  functional  design.  The  system  is 
described  in  terms  of  the  functions  that  process  the  data.  Functions  receive 
input,  transform  input,  and  produce  output.  The  functional  requirements 
define  what  needs  to  be  done.  The  functional  design  includes  the  require¬ 
ments  as  well  as  the  data  flow.  Pictured  below  are  the  various  levels  of  a 
functional  design. 

At  the  highest  level,  a  block  diagram  of  the  system  showing  inputs  and 
outputs  would  look  like  this: 
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3.4  Implementation 

The  blackboard  control  architecture  model  defines  two  blackbocirds,  one  for 
the  domain  and  one  for  control.  A  set  of  data  structures  and  a  set  of  knowl¬ 
edge  sources  are  associated  with  each  blackboard.  The  data  structure  for 
each  blackboard  has  its  own  hierarchy.  Each  of  these  components  is  defined 
below  for  the  monitoring  problem. 

3.4.1  Domain  Blackboard 

The  hierarchy  for  the  domain  blackboard  contains  three  levels  of  abstraction: 
signals,  emitters  and  sites.  The  domain  knowledge  sources  are  listed  below: 

•  KSO.  Initializes  table  of  locations  of  known  friends  on  domain  black¬ 
board. 
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•  KSl.  Reads  input  data,  creates  signal  node  on  signal  level  of  domain 
blackboard  or  creates  expectation  on  site  level  of  domain  blackboard. 

•  KS2.  Generates  emitter  node  on  emitter  level  of  domain  blackboard 
based  on  characteristics  of  signal  node. 

•  KS3.  Generates  site  node  on  site  level  of  domain  blackboard  based 
on  characteristics  of  emitter  node  zmd  other  information  on  domain 
blackboard. 

•  KS5.  Handles  duplicate  emitter  nodes  on  emitter  level  of  domain 
blackboard,  updating  history  information. 

•  KS6.  Initiates  termination  of  system. 

•  KS7.  Prints  SOS  message  when  possible  threat  is  sighted. 

•  KS8.  Outputs  CSB  and  associated  information. 

Execution  of  the  knowledge  sources  reflects  changes  in  the  environment. 
Both  forward  and  backward  reasoning  are  used  on  the  domain  blackboard. 
Forward  reasoning  takes  signal  input  data,  creates  emitter  types  from  the 
signal  data,  and  generates  threat  or  nonthreat  sites  from  the  emitter  types. 
Backward  or  model  driven  reasoning  occurs  when  intelligence  reports  indicate 
a  threat  at  a  specific  location  at  a  specific  time.  The  site  Ccin  then  be  verified 
based  on  the  intelligence  report.  Discrepancies  at  this  level  result  from  noisy 
or  unreliable  data. 

Updating  the  domain  blackboard  creates  events  that  reflect  the  updated 
environment.  These  events  are  recorded  in  knowledge  source  activation 
records  (KSARs). 

3.4.2  Control  Blackboard 

The  control  blackboard  contains  three  levels  of  abstraction  that  define  the 
strategy  used  for  handling  events  that  occur  in  the  system.  These  levels 
are  policy,  tactic  and  step.  Policy,  located  on  the  highest  level,  determines 
which  blackboard  is  effected  -  domain  or  control.  Tactic,  on  the  next  level, 
specifies  the  category  of  event  to  process.  The  lowest  level  of  abstraction, 
step,  differentiates  between  the  two  low  level  event  types. 
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The  control  knowledge  sources  toggle  between  possible  values  on  each 
level  of  abstraction  to  implement  the  chosen  strategy. 

•  KSIO.  Initializes  the  system  with  problem  description  from  user. 

•  KSll.  Initializes  control  blackboard  by  setting  up  policy,  tactic  and 
step  level  for  policy  being  used. 

•  KS12.  Generates  event  to  change  policy  if  policy  is  not  “control”. 

•  KS13.  Toggles  tactic  setting. 

•  KS14.  Toggles  step  setting. 

•  KS20.  Toggles  policy  setting. 

Three  additional  domain-independent  procedures  -  update  trigger  list, 
choose  KSAR,  and  execute  KSAR  -  drive  an  iterative  control  strat¬ 
egy  that  operates  around  three  event  lists  -  action,  trigger  and  invocable. 
Update  trigger  list  moves  KSARs  from  the  action  list  to  the  trigger  list. 
Choose  KSAR  matches  conditions  of  control  and  the  environment  with  the 
KSAR  and  moves  the  KSAR  to  the  invocable  list  if  the  conditions  are  met. 
Execute  KSAR  executes  the  knowledge  source  of  the  first  KSAR  on  the 
invocable  list. 

3.4.3  Operation 

A  KSAR  contains  information  on  the  triggering  cycle,  triggering  event,  pre¬ 
condition  values,  condition  values,  trigger  weight,  knowledge  source  impor¬ 
tance,  event  type,  rating  and  priority.  The  KSAR  is  initially  placed  on  the 
action  event  list.  The  action  event  list  contains  all  KSARs  generated  by  the 
previous  cycle’s  execution.  As  KSARs  on  the  action  event  list  are  moved  to 
the  trigger  event  list,  the  events  are  given  a  rating  based  on  the  current  state 
of  the  environment  and  the  importzmce  of  the  associated  knowledge  source. 
The  KSARs  are  placed  on  the  trigger  event  list  in  descending  order  by  rating 
points.  KSARs  axe  moved  to  the  invocable  event  list  based  on  the  current 
control  strategy.  The  knowledge  source  of  the  first  KSAR  on  the  invocable 
list  is  executed.  If  no  knowledge  source  is  triggered  on  a  particular  cycle,  cin 
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event  is  created  that  will  enable  a  change  in  policy,  tactic  or  step  level  on 
the  control  blackboard. 

The  action,  trigger  and  invocable  event  lists  contain  both  domain  cmd 
control  events.  The  policy,  tactic  and  step  settings  determine  which  KSARs 
get  moved  to  the  invocable  list.  Only  one  event  activates  a  knowledge  source 
(KS)  each  cycle  -  the  event  that  is  first  on  the  invocable  list.  The  execution 
of  a  knowledge  source  on  a  cycle  generates  new  KSARs  on  the  action  list.  A 
general  diagram  of  the  system  is  shown  below. 
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4  Use  of  Ada 


The  monitoring  system  was  developed  in  Vax  Ada  on  a  Vax  11/780  running 
the  VMS  operating  system.  The  use  of  Ada  for  this  prototype  presented  no 
major  problems.  The  resulting  system  has  the  structure  shown  below.  (The 
arrows  indicate  required  visibility.) 


Several  of  Ada’s  features  greatly  facilitated  the  implementation.  The 
package  concept  facilitates  data  encapsulation,  modularity,  and  locality.  Op¬ 
erations  on  a  particular  object,  for  example  an  event,  are  contained  in  a 
package  with  the  implementation  details  hidden  from  users  of  the  package. 
Changes  in  the  implementation  will  not  effect  the  control  structure  or  the 
object’s  interaction  with  other  objects.  The  modularity  feature  of  the  pack¬ 
age  encourages  reusability  as  basic  operations  can  be  reused  and  not  repro¬ 
grammed.  Packages  were  used  to  contain: 
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•  blackboards  with  associated  procedures  (data  and  utilities  package) 


•  procedures  to  execute  cycle  for  control  (control) 

•  knowledge  sources  (KSs) 

The  data  and  utilities  package  contains  aJl  global  structures  and  utilities. 
The  control  and  domain  blackboards  are  defined  in  this  package  along  with 
procedures  that  delete  information  from  the  domain  blackboard  as  well  as 
create,  add  and  delete  nodes  from  the  action,  trigger,  and  invocable  event 
lists.  Much  of  this  package  is  reusable  since  it  manipulates  the  structures  for 
a  blackboard  control  architecture. 

Each  knowledge  source  is  contained  in  a  separate  package.  New  knowledge 
sources  can  be  added  and  existing  ones  changed  without  recompiling  the 
entire  system.  (Instead  of  packages,  knowledge  sources  could  be  contained  in 
separately  compilable  procedure  modules.)  Keeping  these  knowledge  sources 
separate  will  also  facilitate  the  introduction  of  concurrency  into  the  system 
using  the  tasking  feature.  More  than  one  knowledge  source  could  then  be 
executed  in  a  cycle. 

Other  Ada  features  were  also  useful  in  the  development  of  this  system. 
Variant  records  were  used  to  represent  blackboard  nodes  since  the  nodes  on 
the  different  levels  of  the  domain  blackboeud  contained  different  information 
fields.  Using  vairiant  records  made  it  possible  to  manipulate  the  nodes  on  all 
levels  with  the  same  procedures. 

The  one  disadvantage  that  we  encountered  resulted  from  change  -  in  the 
structure  or  information  that  was  on  one  of  the  blackboards.  Any  changes  in 
the  data  and  utilities  package  made  it  necessary  to  recompile  all  other  units. 
Changing  the  tables  from  static  to  dynamic  and  initializing  them  from  files 
would  eliminate  many  of  the  changes  that  were  required. 


5  Conclusions 

A  prototype  monitoring  system  based  on  the  blackboard  control  model  of 
problem  solving  was  successfully  implemented  in  Ada.  Blackboard  systems 
exhibit  several  characteristics  that  parallel  features  of  Ada.  The  indepen¬ 
dent  knowledge  sources  can  be  developed  as  program  units.  Packages  can 
encapsulate  operations  on  specific  types  of  data.  The  tasking  mechanism  can 
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provide  the  dynamic  property  required  by  the  control  mechanism.  The  use  of 
Ada  tasks  will  zJlow  a  natural  movement  to  multiprocessor  systems.  These 
features,  along  with  the  capability  to  incrementally  build  a  solution,  make 
Ada  a  good  choice  for  a  blackboard  system. 

The  incorporation  of  AI  techniques  into  solutions  of  real  world  problems 
has  been  hampered  by  the  use  of  speciality  languages  for  their  implementa¬ 
tion.  Ada  provides  the  vehicle  for  moving  these  techniques  out  of  the  research 
laboratories.  Successful  integration  of  artificial  intelligence  techniques  into 
Ada  will  create  adaptable  software  packages  that  are  highly  suitable  for  solv¬ 
ing  real  world  problems. 
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Abstract 


The  electronic  transport  properties  (electrical  conductivity, 
mobility  and  carrier  concentration)  have  been  measured  in  undoped, 
SI-GaAs  wafers  which  have  been  subjected  to  various  surface 
treatments.  Several  samples  were  selected  and  measurements  have  been 
carried  out  in  the  temperature  range  from  ?50  to  AOO^K  with  gas  ambients 
Np,  He  air  and  vacuum.  Tt  has  been  observed  that  removal  of  several 
microns,  before  the  measurements,  and  the  choice  of  ambients  during  the 
measurements,  both  can  have  a  substantial  effect  on  the  apparent 
conductivity,  concentration  and  mobility.  It  is  concluded  that  results 
are  related  to  the  conduction  lattice  damage  (subsurface  damage)  and 
surface  states  (conductive  oxide  layers)  on  the  surface. 
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SURFACE  STATES  and  ELECTRON  TRANSPORT  PROPERTIES  IN 
SEMI -INSULATING  GaAs 


I.  Introduction 


The  purpose  of  this  research  was  to  continue  and  extend  the  study ^ 
of  the  surface  states  in  semi-insulating  (SI)  GaAs  that  was  fi) 
initiated  at  the  Avionics  Laboratory,  Wright  Patterson  Air  Force  Base, 
in  May-July,  1987  under  USAF-UES  Summer  faculty  Research  program,  and 
(ii)  subsequently  funded  by  UFS/AFOSR  under  Research  Initiation  Program. 
SI-GaAs  available  from  Avionics  Laboratory,  which  were  grown  by  Texas 
Instruments  and  Rockwell  International,  were  used.  In  particular,  we 
have  selected  random  wafers  and  wafer  sections  and  measured  the  Hall- 
the  Hall -properties,  and  dc-conductivities  in  the  temperature  range  ?50 
to  400^K.  For  ac-properties ,  we  have  developed  a  system  suitable  for 
the  temperature  range  77  to  400^K  and  frequency  range  10  kHz  to  40  MHz. 
From  these  measurements,  the  effect  of  the  surface  states  have  been 
investigated  in  various  gas  ambients  IN^,  He,  air  and  vacuum).  The 
results  have  been  interpreted  in  terms  of  recently  predicted  theories 
for  SI-GaAs. 

1 1 .  Research  Work  and  Technical  Approach 

The  substrate  material  for  many  high  performance  GaAs  device  is 

undoped  SI-GaAs.  It's  usefulness  and  practical  importance  are  well 

known.  Many  studies  and  results  on  SI-GaAs  have  been  reported  and  many 

problems  have  been  worked  out.  However,  there  remain  some  problems, 

such  as,  the  effect  of  nonuniformity  a  major  defect  (EL?)  and  its 

distribution  on  the  electronic  transport  properties.  The  correlations 

are  still  not  clearlv  understood  and  hence  all  necessary  information 

on  this  issue  is  not  available.  When  SI-GaAs  is  used  as  substrate 

o 

material  for  devices,  after  MBE  layers  are  grown,  it  is  believed' 


61-2 


characteristics  of  the  devices  will  heavily  depend  on  the  properties  of 
the  substrate.  The  surface  states  or  defects  could  arise  in  the  samples 
from  absorbed  or  adsorbed  qas  molecules.  The  ionized  molecules,  in  the 
presence  of  electric  field,  could  thermally  diffuse  into  the  layers 
immediately  below  the  surface  and  might  interfere  with  the  conduction 
band  electrons.  Yeo  et  al  .  have  investigated  the  surface  effect  on 
Si-implanted  GaAs  (n-type)  and  concluded  that  these  surface  states  could 
also  result  from  the  abrupt  termination  of  the  lattice  or  defects  at  the 
semiconductor  surface,  and  therefore,  they  can  pin  the  Fermi  energy 
within  the  forbidden  gap  at  the  surface,  creating  a  carrier  depletion  in 
the  region  just  below  the  surface.  The  long  range  potential  fluctuation 
is  a  real  possibility  due  to  a  nonuniform  distribution  of  the  impurities. 

4-8 

Many  studies  of  the  conductivity  on  SI-GaAs  indicate  similar 
interesting  results.  For  example,  the  ac-conducti vity  results  show  that 
the  conductivity  is  frequency  dependent  even  at  high  temperatures. 
Normally,  amorphous  semiconductors  and  dielectrics  show  a  strong 
frequency  dependent  conductivity,  but  crystalline  semiconductors  show  • 
somewhat  smaller  effect  at  relatively  low  temperatures.  On  the  other 
hand,  dc-conducti vity  measurements  have  been  explained  in  terms  of 
quantum  mechanical  tunneling  between  localized  gap  states  in  the 
bandgap.  These  states  are  separated  by  potential  barriers  which  are 
found  to  fluctuate  with  the  thermal  vibration  of  the  lattice.  Other 

4 

studies  indicate  that  the  temperature  dependence  of  the 
dc-conducti vity,  especially  in  the  low  temperature  region,  is  given  by 
the  Berthelot  equation.  For  the  ac-conducti vity,  as  well,  there  are  some 
recent  developments^.  Abdalla  and  Pistoulet^  proposed  a  model  based  on 
Gaussian  distribution  of  potential  fluctuation  suggesting  aau;^  where 
s  <1  for  the  intermediate  range  of  (»)<10^  Hz,  but  s  =  2  form  >  lO^Hz  under 
some  approximations.  Hqwever,  their  model  depends  on  a  number  of 
unknown  input  parameters  whose  justification  may  be  questionable. 

In  this  research  we  have  investigated  surface  states  effect  on 
several  SI-GaAs  samples.  These  are  illustrated  in  Table  I.  These 
samples  are  available  from  WPAFB,  and  have  been  studied  very  extensively. 
More  detailed  description  of  sample  growth  and  characterization  are 
available  elsewhere. 
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The  Tl-satnples  (Texas  Instruments)  are  low-pressure,  liquid-encapsulated 

Czochralski  (LEC),  near-tail-end  of  the  boule,  which  are  undnppd,  but  R- 

samples  (Rockwell)  are  a  high  pressure  LEC,  near-seed-end,  and  doped 

with  In  (?.3xl0  /cm"  in  the  melt).  For  this  reason,  these  sets  are 

contrasting.  The  methods  of  sample  preparation,  polishina  and  etchinq, 

9  10 

etc.  are  also  described  in  the  above  two  reports.  ’ 

Our  efforts  have  been  directed  in  two  directions:  (i)  investigation 
of  several  samples,  both  of  different  wafer  numbers  and  different  wafer 
sections,  and  the  (ii)  measurement  of  dc-conducti vities  and  Hall 
parameters,  especially  in  the  temperature  range  250  to  400°K. 

Ill  Experimental 

In  this  research  we  have  included,  as  shown  in  Table  I,  two  types 
of  SI-GaAs  samples  (Texas  Instruments  and  Rockwell  International).  The 
last  number  in  each  sample  indicates  the  location  on  the  wafer  section 
(64  and  65  in  the  present  case) 

The  Hal  1 -properties  and  the  dc-conductivities  were  measured  at 
Avionics  Laboratory,  WPAFB  in  collaboration  with  D.C.  Look.  The 
experimental  setup  was  a  standard  van  der  Pauw  configuration  and  is 
shown  in  Fig.  1.  For  ac-properties ,  we  have  developed  a  compatible 
system  here  at  HSF,  which  is  shown  in  Fig.  2.  As  is  well  known  the 
resistivity  of  these  samples  were  quite  high  (10  -x-lO  ohm. cm).  This 
required  an  impedance  measurement  of  about  500^  800  Mn,  which  was  far 
beyond  the  limit  of  our  present  capability  of  the  HP4275A  LCR-met^r 
(limit  being  20  Mp  ).  Hence  the  present  setup  could  not  be  used  for 
SI-GaAs  samples.  We  shall  discuss  later  on  what  improvements  would  have 
to  be  made  in  the  future  to  overcome  these  limitations.  The  results  are 
shown  in  figs.  3-9. 
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IV,  Results  and  Pi scussions 

The  data  from  de-measurements  indicate  that  the  resistivity  of  the 

8  6  3 

samples  is  quite  high  10  cm,  concentration  10  /cm  and  mobility  u 
3  2 

10  cm  V-s.  The  conductivity  a  is  linear  with  1/T  when  plotted  on 
logarithmic  scale  in  the  high  temperature  region,  but  there  is  some 
nonlinearity  in  the  low  temperature  region,  particularly  below  room 
temperature.  The  slopes  of  the  logo  vs  1/T  plots  are  different  for 
different  gas  ambients,  especially  below  room  temperature.  The 
behavior  of  the  concentration  plots,  i.e.,  log  n  vs  1/T  is  the  same  as 
that  of  log  a  vs  1/T  plots,  since  a  is  not  nearly  as 
temperature-dependent  as  n.  The  mobility  y,  as  expected,  increased 
with  1/T  in  the  high  temperature  region.  The  data  from  ac-measurements 
are  not  available  for  these  samples  as  yet,  since  the  limit  of  our 
equipment  was  20  m  fi,  whereas  resistance  in  these  samples  ranged  between 
100  to  800  Mfl.  We  are,  therefore,  looking  into  alternative 
possibilities,  which  will  be  implemented  into  the  second  phase  of  our 
research. 

c 

Kristofik  et  aV.  measured  the  temperature  and  frequency  dependence 
of  electrical  conductivity  of  SI-6aAs  and  concluded  that  the  surface 
conductivity  is  explained  by  the  Berthelot  equation,  whereas  the  bulk 
conductivity  has  the  Arrhenius-type  behavior.  This  type  of 
investigation  was  followed  by  Abdalla  and  Pistoulet^,  and  Pistoulet  and 
Hamandjian^.  They  suggested  that  the  long  range  potential  fluctuations 
can  give  rise  to  mixed  conductivity  due  to  the  accumulation  of  electrons 
in  potential  wells  separated  in  space  from  holes  accumulated  in 
potential  hills.  They  have  further  carried  out  a  theoretical  analysis 
of  the  problem  by  assuming  a  potential  fluctuation  with  truncated 
Gaussian  distributions,  which  depended  on  a  number  of  input  parameters 
(as  many  as  eight  parameters).  On  the  other  hand,  Look^  had  already 
pointed  out  earlier  the  existence  of  mixed  conduction  in  Cr-doped 
SI-6aAs  from  his  study  of  the  Hall  effect  and  magnetoresistance.  Later 

s' 

on,  he  has  developed  an  appropriate  theory  to  represent  the  mixpd 

2 

conduction  effect  in  semiconductors  in  general.  Lo  et  al  .  have 
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demonstrated  by  a  surface  property  study  using  cw-electrc-optic  probing 
that  the  EL2  concentration  on  the  surface  of  SI-6aAs  is  nonuniform 
across  the  wafer,  which  tends  to  be  depleted  near  the  surface  after 
thermal  annealing.  This  surface  depletion  of  EL2  defect  can  cause  a 
very  leaky  substrate  surface  promoting  a  serious  backgate  effect.  Very 
recently  Lester  et  a.^^  have  reported  studies  on  ambient-induced  surface 
effects  in  GaAs,  demonstrating  sensitivity  of  the  gas  environment  on  the 
conductivity  and  photolumiescence.  Kuhnel  et  al .  '  found  similar 
effects  on  their  data  of  Hall  properties. 

The  plots  shown  in  Figs.  3-9  show  marked  effects  not  only  from 
sample  to  sample,  but  also  from  ambient  to  ambient.  The  conductivity 
and  concentration  data  show  a  gradual,  but  smooth,  variation  with 
temperature,  even  in  the  region  of  low  temperature.  The  mobility  data, 
however,  are  very  sensitive  in  this  region  as  is  evident  from  Figs. 

5-9.  The  effect  of  gas  molecules  apparently  do  enhance  the  surface 
effect  more  in  mobility  than  in  the  conductivity  or  concentration  in 
this  region. 

The  direction  of  the  conductivity  changes  indicate  that  the 
surface  charge  becomes  more  negative  during  exposure  to  air  increasing 
in  positive  space  charge  and  moves  the  surface-Fermi  level.  Lester  et 
al.^^  have  investigated  the  ambient  induced  movement  of  the  surface-Fermi 
level  in  InP  and  GaAs.  In  fact,  they  have  estimated  from 
photoluminescence  data  that  the  Fermi  level  at  0.8  eV  below  the 
conduction  band  in  GaAs.  Our  concentration  data  further  supports  this 
conclusion. 

Kuhnel  et  al.^^  looked  at  Hall  effect  and  conductivity  measurements 

in  SI-GaAs  and  suggested  a  correction  procedure  for  the  surface  effects 

13 

in  their  samples.  However,  Johnson  et  al.  found  a  nonuniform 
distribution  of  charging  effect  by  electron  beam  in  SI-GaAs,  which  they 
attributed  to  the  surface  effect.  This  has  been  interpreted  as  the 
interaction  between  the  propagating  domains  (presumably  the  gas 
ambients)  and  the  cellular  dislocation  structure  formation  in  SI-GaAs. 
These  interactions  seemingly  produce  noise  in  the  current  oscillation 
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waveforms.  This  has  been  also  interpreted  as  the  fluctuation  of  the 

5  6 

long  range  potentials  by  others  ’  . 

Our  mobility  data  apparently  do  not  indicate  any  systematic  pattern 

in  the  low  temperature  region  except  that  they  are  extremely  sensitive 

to  the  formation  of  surface  states.  Another  important  consideration 

should  be  made  with  regard  to  the  subsurface  damage.  It  has  been 

14 

recently  shown  by  Ives  and  Leung  that  a  subsurface  damage  on  GaAs 
wafers  (apparently  SI)  could  affect  the  reliability  and  performance  of  a 
device  fabricated  on  such  wafers.  We  are  at  present  attempting  to  go 
still  lower  in  temperature  and  establish  a  definite  trend  with  various 
gas  ambients.  The  frequency  dependant  data,  which  are  not  available  as 
of  this  moment  in  our  laboratory,  will  further  help  to  explain  this 
situation. 

V.  Conclusions 

Temperature  dependent  Hall  effect  measurements  have  been  carried  out 
on  undoped,  SI-GaAs  wafers,  which  have  been  subjected  to  various  surface 
treatments.  It  is  found  that  removal  of  several  microns,  before  the 
measurements,  and  the  choice  of  ambients  (H^,  He,  air  and  vacuum)  during 
the  measurements,  both  can  have  a  substantial  effect  on  the  apparent 
mobility  and  carrier  concentration.  Results  are  related  to  conduction 
lattice  damage  in  the  near-surface  region,  and  conductive  oxide  states 
on  the  surface. 
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Table  I:  Samples  Studied 


Lab  Code 

Texas  Inst.  Samples 

Lab  Code 

Rockwell  Samples 

AA 

TI 

31-41-65 

CC 

R158-6-65 

FF 

TI 

31-10-64 

HH 

R158-6-64 

GG 

TI 

31-41-64 

II 

R158-33-64 

EE 

TI 

31-10-65 

i 

i 
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Von  der  Pouw 
contig 


Figure  1 


Schematic  diagram  of  the 
effect  apparatus.  A  standard 
shown  in  the  diagram. 


automatic,  high  resistivity  Hall- 
van  der  Pauw  configuration  is  also 
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Ex.  Giifi 


S 

C 


Shi 

91C 

Analojg  Output 


Figure  2 

Schematic  diagram  of  the  automatic  SST-System  for  the  ac-properties . 
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MOBILITY  *10  icm  /v.cmj  CONC. 


MOBILITY 
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Figure  7 .  Sample  GG 
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Figure  8.  Sample  HH 
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ABSTRACT 


This  report  describes  reasearch  performed  under  a  mini-grant  resulting  from  the 
1987  Summer  Faculty  Research  Program  sponsored  by  the  Air  Force  Office  of  Sci¬ 
entific  Research.  Previously  a  preliminary  characterization  was  performed  for  new 
luminescent  rebroadcasting  materials  developed  with  support  from  Wright  Patterson 
Air  Force  Base.  These  devices  axe  significant  because  they  have  optical  disk  resolu¬ 
tion  and  potentially  nanosecond  write-read  speed.  Basic  processing  functions  were 
also  demonstrated.  Experiments  performed  in  this  project  show  for  the  first  time  the 
feasibility  of  using  luminescent  rebroadcasting  devices  for  implementing  three  forms 
of  associative  memory;  multiple  correlations,  symbolic  substitution,  and  neural  net¬ 
works.  Comparisons  between  the  approaches  and  their  advantages  and  disadvantages 
are  provided.  The  results  and  ease  of  working  with  these  materials  makes  this  a  good 
area  for  further  research  to  develop  a  practical  associative  memory. 
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Chapter  1 
Introduction 


1.1  Background 

This  report  describes  research  continuing  from  a  summer  faculty  research  program 
sponsored  by  the  Air  Force  Office  of  Scientific  Research  performed  at  Wright  Patterson 
Air  Force  Base  A  ADO-2  [21].  New  luminescent  rebroadcaisting  devices  were  investi¬ 
gated  that  had  been  developed  under  funding  from  Wright  Patterson  Air  Force  Base. 
The  new  materials  could  result  in  a  breakthrough  in  write-read  memory  technolo¬ 
gies  because  resolution  approaches  that  of  optical  disk,  i.e.  0.6  ^m,  and  nanosecond 
speeds  seem  possible  [5].  In  particular,  we  explore  the  use  of  these  devices  for  asso¬ 
ciative  memory  (AM)  because  associative  memory  appears  to  be  suitable  for  taking 
advantage  of  this  potential  performance. 

Associative  memories  are  an  increasingly  important  computer  technology  because 
of  the  speed  obtainable  with  highly  parallel  search.  In  associative  memory,  data  is 
accessed  by  specifying  part  of  the  contents  of  the  data.  The  part  searched  for  in  the 
associative  memory  may  be  considered  the  input  and  the  data  associated  with  it  may 
be  considered  to  be  the  output.  When  the  output  includes  both  the  search  string  and 
associated  data  the  memory  is  often  referred  to  as  content  addressable. 

This  contrasts  with  random  access  memory  (RAM)  where  data  is  accessed  by 
specifying  memory  locations  or  addresses.  In  AM,  data  is  accessed  by  specifying  part 
of  the  contents.  A  phone  directory  provides  an  example.  In  the  case  of  RAM  the 
information  is  accessed  by  remembering  that  a  certain  persons  information  is  in  a 
specific  location  or  address,  usually  encoded  in  binary.  Thus  only  log2N  bits  will 
determine  one  of  N  locations  in  memory.  In  the  case  of  AM  the  persons  data  is 
accessed  by  providing  a  part  of  his  name  or  phone  number.  The  AM  then  provides 
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the  rest  of  the  information. 

RAM  is  widely  used  on  present  day  computers  because  it  is  much  cheaper  with 
existing  technologies.  It  is  cheaper  because  the  binary  encoding  reduces  the  number  of 
interconnections.  However,  this  results  in  serial  addressing  through  a  single  decoder. 

AM,  in  contrast  to  RAM,  operates  in  parallel  and  is  therefore  faster.  It  is  increas¬ 
ingly  used  in  modern  computers  where  speed  is  necessary,  for  example,  (a),  translation 
look  aside  buffers  for  rapidly  translating  most  recently  used  virtual  memory  addresses 
to  physical  memory  addresses,  and  (b),  cache  directories  for  rapid  determination  of 
whether  data  required  for  subsequent  computations  is  in  cache.  Another  reason  for 
investigating  AM  is  the  increasing  interest  in  high  speed  parallel  computing,  infor¬ 
mation  storage  and  retrieval  with  rapidly  changing  databases,  and  real-time  AI.  An 
advantage  of  AM  is  that  it  is  more  forgiving  because  only  part  of  what  is  sought  needs 
to  be  supplied.  In  this  respect  it  operates  more  like  human  memory. 

Human  experts  seem  to  use  associative  memory  (AM)  rather  than  rule  based 
systems  most  of  the  time.  The  expert  becomes  faster  as  past  experiences  accumulate. 
In  contraist,  rule  based  systems  become  slower  as  the  expertise  in  the  form  of  more 
rules  is  increased. 


1.2  Prior  optical  associative  memory  implemen¬ 
tations 

Most  optical  AM  proposed  and  constructed  for  experimentation  use  spatial  light 
modulators  (SLMs)  to  perform  correlation  or  equivalently,  matched  filtering,  of  the 
input  against  a  set  of  templates.  The  highest  correlation  is  used  to  classify  patterns. 
When  the  template  having  the  highest  correlation  directly  replaces  the  input,  as  is 
often  the  case  in  optical  designs,  the  system  is  referred  to  as  an  AM  [13], [1].  Parallel 
correlation  against  many  templates  may  be  performed  by  generating  multiple  images 
of  the  input  [13]  or  by  storing  many  templates  at  different  angles  in  a  hologram  [l]. 
Sequential  correlation  is  performed  by  presenting  the  templates  one  after  the  other 
in  a  single  correlation  scheme. 

Many  systems  in  optics  use  Fourier  transforms  because  convolution  is  transformed 
to  multiplication  and  the  Fourier  transform  is  easily  obtained  with  a  thin  lens.  A  Van 
der  Lugt  filter  is  used  [8]  because  the  transform  of  a  real  image  is  complex  and 
interference  with  a  plane  wave  is  a  holographic  method  of  storing  complex  functions. 

Symbolic  substitution  may  be  used  for  associative  memory  and  simple  experiments 
have  previously  been  performed  [3].  A  commonly  used  neural  network  structure  uses 
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matrix-vector  multiplication.  Many  simple  experiments  have  been  performed  using 
matrix- vector  multiplication[19],[25],[17],[28],[26]. 

In  this  project  we  explore  for  the  first  time  the  use  of  luminescent  rebroadcasting 
devices  for  three  associative  memory  designs:  multiple  correlations,  symbolic  substi¬ 
tution,  and  neural  networks.  Advantages  and  disadvantages  of  each  design  will  be 
discussed. 
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Chapter  2 

Device  characterization 


2.1  Samples  used 

Three  types  of  luminescent  rebroadcasting  samples  are  available  currently.  Sample 
A  is  a  2  by  2  inch  size  thick  film  in  a  polymer  encapsulation.  Sample  B  is  a  0.S5 
by  0.85  inch  thick  film  bonded  to  a  sapphire  window.  Sample  C  is  a  0.85  by  0.85 
inch  thin  film  bonded  to  a  sapphire  window.  Sizes  refer  to  active  surface  area.  Thick 
film  is  approximately  250  fim  and  thin  film  approximately  4  ^m.  The  approach  and 
results  of  experiments  follow  in  categories  of  resolution,  speed,  energy  consideration 
and  spectral  response. 


2.1.1  Resolution 

The  resolution  testing  procedure  was  to  first  illuminate  the  device  with  sufficient 
infrared  (IR)  energy  at  1064  nm  to  read  out  all  the  stored  energy.  A  resolution  chart 
wa^  attached  to  the  surface  of  the  device.  To  reach  saturation,  sufficient  energy  from 
a  blue  Argon  laser  at  488  nm  was  used  to  write  uniformly  in  space  on  the  device.  The 
chart  was  removed  and  IR  at  1064  nm  was  then  used  to  cause  luminescence.  Samples 
A  and  C  luminesce  in  orange,  while  sample  B  luminesces  in  green. 

Figure  2.1(a)  shows  the  result  for  the  polymer  encapsulated  thick  film,  sample 
A.  Based  on  the  center  left  of  this  chart  the  resolution  is  125  line  pairs  per  inch. 
This  corresponds  to  distinguishing  an  array  of  at  least  200  by  200  for  active  area  of 
sample  A.  Figure  2.1(b)  shows  the  result  for  the  thick  film  on  sapphire,  sample  B.  The 
resolution  is  higher,  at  approximately  175  line  pairs  per  inch.  This  would  permit  an 
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axray  of  150  by  150  to  be  distinguished  with  the  active  area  of  the  sample  provided. 

Figure  2.1(c)  shows  that  the  resolution  of  the  thin  film  is  much  higher  than  samples 
A  and  B,  at  over  275  line  pairs  per  inch.  This  corresponds  to  an  array  of  230  by  230 
for  the  active  area  of  sample  C.  The  improvement  in  resolution  for  the  thinner  sample 
is  obvious  from  Figure  2.1,  but  the  trade  off  is  that  the  luminescent  intensity  of  the 
orange  light  makes  the  thinner  sample  output  much  harder  to  detect. 


2.1.2  Speed 


The  free-electron  lifetime  determines  the  response  time  of  the  luminescence  signal  to 
the  stimulating  laser  beam.  It  has  been  proved  that  the  response  time  of  the  material 
is  less  than  50  ns  full  width  at  half-maximum[5]{2].  In  order  to  use  the  material 
for  computing  continuously,  it  is  necessary  to  know  the  photon  density  required  to 
significantly  clear  the  trap  during  the  short  exposure  time  to  the  IR  laser  beam. 
This  photon  density  combined  with  the  illuminating  .Ar  laser  power  will  decide  the 
cycle  time  for  writing  and  reading.  Generally  speaking,  the  time  needed  to  clean  the 
device  for  the  next  operation  is  longer  than  reading  from  or  writing  to  the  device.  A 
preliminary  experiment  concerning  the  reading  and  writing  speed  is  reported  in  [2l]. 


2.1.3  Energy  consideration 


Figure  2.2  shows  an  experimental  result  for  the  energy  response  of  the  device  using  the 
encapsulated  thick  film,  sample  A.  The  sample  was  exposed  under  different  energies 
of  the  writing  beam  and  the  reading  beam  power  remained  constant.  Reading  occurs 
following  each  writing.  The  sample  was  cleared  completely  after  each  reading.  The 
detector  was  positioned  along  a  constant  angle  with  respect  to  normal.  Thus  output 
power  readings  between  samples  are  relative.  Also,  output  power  is  the  reading  taken 
just  when  the  read  beam  Wcis  turned  on.  A  well  defined  linear  area  was  found  where 
arithmetic  operations  could  be  performed  [15].  The  same  experiment  was  conducted 
on  the  thin  sample,  sample  C.  A  similar  result  is  plotted  in  Figure  2.3  except  that 
the  luminescence  of  the  thin  sample  was  about  one  order  of  magnitude  lower  than 
the  thick  sample,  which  is  believed  due  to  the  contribution  of  thickness. 

A  thicker  sample  can  store  more  energy  in  the  same  spatial  location  than  the 
thinner  sample  resulting  in  a  higher  energy  requirement  of  saturation.  Once  the 
saturation  is  reached  the  excess  writing  energy  will  penetrate  through  the  sample 
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(a) 


(b) 


(c) 

Figure  2.1:  Resolution  tests:  (a)  for  thick  film  polymer  encapsulated  material  (sample 
A),  (b)  for  thick  film  on  sapphire  (Sample  B),  (c)  for  thin  film  on  sapphire  material 
(Sample  C). 
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Relative 


Intensity  (nW) 


Figure  2.2;  Luminescent  intensity  vs  input  energy  density  on  thick  sample 
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Relative 
Intensity  (nW) 


Figure  2.3:  Luminescent  intensity  vs  input  energy  density  on  thin  sample 
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and  logic  operations  caji  be  performed  in  this  energy  region.  Figure  2.4  and  Figure 
2.5  show  the  luminescent  intensity  curve  from  maximum  to  its  stable  state  for  both 
thick  and  thin  sample.  The  thin  sample,  as  expected,  reached  its  stable  state  faster 
than  the  thick  sample.  This  implies  that  the  thin  sample  is  more  suitable  for  fast 
logic  operations. 


2.1.4  Spectral  response 

All  samples  responded  to  blue  and  green  for  writing.  Samples  A  and  C  emitted 
orange  and  sample  B  emitted  green.  The  luminescence  emitted  by  the  thin  film 
material  sample  C,  appeared  to  be  one  order  of  magnitude  less  than  that  for  the 
thick  films.  However,  high  speed  [5]  and  resolution,  comparable  to  optical  disc,  are 
anticipated.  Devices  with  different  dopants  have  different  spectral  characteristics[ll] 
and  quantum  efficiencies  of  photoluminescence(12]. 


2.2  Arithmetic  operations 

Analog  addition.  Analog  addition  of  images  A,  B,  C  etc.  is  performed  by  writing 
one  image  after  the  other  onto  a  device  that  was  previously  cleared  by  flooding  with 
IR[16].  The  images  may  be  considered  to  be  2-D  arrays  of  elements  whose  intensities 
represent  analog  numbers.  The  array  elements  are  then  added  in  parallel.  The  stored 
energy  may  be  written  as 


S  =  A  +  B  +  C  ■■■■  (2.1) 

On  reading  the  device  by  flooding  with  IR,  the  intensity  of  the  luminescence  at  a 
pixel  is  the  sum  of  the  energies  supplied  to  the  pixel  position  by  each  of  the  images 
added,  provided  the  operation  remains  in  the  linear  range  of  the  device.  Figure  2.6 
shows  the  result  of  exposure  of  the  device  to  a  collimated  beam  at  two  different  po¬ 
sitions  foi  equal  time  such  that  the  center  gets  exposed  to  two  times  the  energy  of 
either  side.  Note  that  the  intensity  of  the  intersection  is  about  twice  as  bright  as  the 
remaining  areas. 


Analog  multiplication.  Parallel  multiplication  of  images  A  and  B  is  performed 
by  writing  one  image,  say  A,  onto  the  device  with  spatially  varying  intensities  rep¬ 
resenting  one  image  and  then  reading  with  IR  having  spatially  varying  intensities 
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Figure  2.6:  Dlustrating  analog  addition  with  luminescent  rebroadcasting  device. 

representing  the  second  image  B.  The  output  luminescence  at  each  pixel  position  is 
proportional  to  the  analog  product  of  the  read  intensity  at  that  pixel  and  the  stored 
value  at  that  pixel 

Out  =  AB  (2.2) 

Figure  2.7  illustrates  analog  multiplication  by  reading  a  previously  uniformly  writ¬ 
ten  device  through  a  variable  density  beam  splitter.  Note  the  stepped  intensities  are 
due  to  different  values  of  operand  B. 

The  energy  that  wais  not  read  out  remains  for  subsequent  operations  and  equals 

5  =  A-(A-5)  =  A-(l-B)  (2.3) 

The  read  out  energy  A-B  is  always  less  than  or  equal  to  the  stored  energy  A,  hence 
the  value  B  in  equation  2.3  is  less  than  or  equal  to  1.  Consequently,  the  device  must 
be  cleared  before  further  multiplication. 


2.3  Logic  operations 

The  device  can  perform  a  sequence  of  OR  operations  on  a  sequence  of  binary  arrays 
by  writing  the  arrays  sequentially  onto  the  device  with  sufficient  energy  to  cause  the 
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Figure  2.7:  Illustrating  analog  multiplication  with  luminescent  rebroadcasting  device. 

device  to  saturate.  From  equation  2.1,  addition  is  now  interpreted  as  OR.  The  device 
can  also  perform  a  sequence  of  AND  NOTs  by  successive  reads  with  binary  arrays  B, 
C,  D,  etc.  cLS  indicated  by  equation  2.3  and  eissuming  that  A  is  an  array  of  ones. 

S  =  (2.4) 

Multiplication  is  now  interpreted  as  AND.  As  B-C  is  equivalent  to  {B  +  C)  or 
NOR,  the  system  is  capable  of  performing  a  sequence  of  NORs. 

In  some  cases,  the  ability  to  perform  alternative  logic  operations  leads  to  more 
eflBcient  implementation  than  using  a  simpler  set.  Table  2.1  shows  14  basic  logic 
operations  between  two  inputs(10].  Two  trivial  operations  are  e.xcluded  in  which  the 
output  is  zero  for  any  input  and  for  which  the  output  is  one  for  any  input.  The 
lower  part  of  the  table  shows  the  sequence  of  incoherent  writes  and  reads  required  to 
perform  the  corresponding  logic  operations  with  a  single  luminescent  rebroadcasting 
device.  Exceptions  follow:  generating  exclusive  OR  (XOR)  and  exclusive  OR  negation 
require  either  coherent  inputs  out  of  phase  by  180  degrees,  or,  like  NAND,  require 
multiple  rebroadcasting  devices.  For  the  NAND,  one  device  is  needed  to  provide  a 
complement  of  B.  “All”  refers  to  saturating  the  device  all  over.  The  “tn”  where  n  = 
1,2,3,  or  4  refer  to  the  exposure  time  sequence.  Note  that  all  the  basic  logic  functions 
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may  be  performed  in  four  or  less  time  steps. 

Every  complete  logic  set  involves  inversion.  This  may  be  performed  by  flooding 
the  device  all  over  to  saturation.  The  pattern  whose  inversion  is  sought  is  used  to 
spatially  modulate  the  IR  in  order  to  remove  energy  from  the  device  where  the  pattern 
is  one.  Those  parts  of  the  device  not  read  out  may  now  be  read  out  with  a  uniform 
IR  beam  and  represent  the  inverse  of  the  pattern. 

Figure  2.8  shows  the  results  of  14  basic  logic  operations  between  two  input  patterns 
A  and  B.  If  we  encode  the  patterns  to  their  corresponding  binary  bit  value  as  suggested 
in  Figure  2.8,  then  all  of  those  14  Boolean  logic  operations  can  be  done  within  four 
steps.  Figure  2.9  shows  an  optical  set  up  for  implementing  those  operations.  The 
apparatus  is  arranged  in  such  a  way  that  the  Ar  laser  and  IR  laser  come  from  different 
angles,  which  reduces  the  complexity  of  the  experimental  procedures. 
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Table  2.1:  Computing  16  basic  Boolean  logic  functions  (except  all  O’s  and  all  I’s) 
with  luminescent  rebroadcasting  device 
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A*8 


A+3 


A  •  3 


Figiu-e  2.8;  Computing  16  basic  Boolean  logic  functions  (except  ail  O’s  and  ail  I's) 
with  luminescent  rebroadcasting  device 


62-18 


CHAPTER  2.  DEVICE  CHARACTERIZATION 


S;  Shiuter,  BS:  Beam  Splitter,  M:  Minor 
Ar.  Argon  Laser,  IR:  Infared  Laser 
LR:  Luminescent  Rebroadcasting 


Figure  2.9:  Optical  set  up  for  arithmetic  and  logic  operations 
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Optical  LR  associative  memory 


3.1  Correlation  with  LR  devices 

3.1.1  Optical  dynamic  matched  filtering 

Correlation  in  space  domain 

Shadow  casting  is  an  effective  means  of  performing  correlation[18]  in  the  space  do¬ 
main  [10].  The  computation  performed  is 

n  m 

^  3;,  Jj/.J  (3.1) 

.=1 }=i 

The  procedure  follows.  The  luminescent  rebroadcasting  material  is  cleared  with 
sufficient  exposure  to  IR  and  kept  in  the  dark.  Argon  light  is  used  to  shadow  cast 
the  first  image  from  a  piece  of  film  onto  the  material,  Figure  3.1(a). 

Writing  occurs  only  where  light  passes  through  the  film.  An  IR  laser  is  used  to 
pass  light  through  the  second  image  on  a  second  piece  of  film  so  as  to  cast  a  shadow  of 
the  second  image  onto  the  material,  Figure  3.1(b).  Only  where  the  IR  strikes  regions 
where  electrons  are  stored  will  the  material  luminesce.  A  lens  is  used  to  collect  the 
luminescent  light.  Consequently,  the  output  intensity  is  greater  for  matching  images. 
For  example  Figure  3.1(b)  provides  a  greater  intensity  out  than  Figure  3.1(c)  assuming 
input  image  areais  are  properly  normalized. 

Figure  3.2(a)  shows  the  results  of  correlating  an  X  with  itself.  Figure  3.2(b) 
shows  the  results  of  correlating  an  X  with  an  0.  Only  the  overlapping  regions  provide 
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Figure  3.1:  Correlation  in  spatial  domain  by  shadow  casting 
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luminescence.  Figure  3.3  shows  the  output  intensity  of  correlation  between  X  with  X 
eind  X  with  0.  Note  the  much  higher  peak  intensity  of  correlation  between  X  and  X 
compared  to  X  and  0. 

However,  it  is  sometimes  impossible  for  multiplicative  correlation  to  distinguish 
between  two  similar  character  images  such  as  E  and  F.  To  avoid  confusion,  a  more 
sophisticated  experimental  procedure  was  designed  as  follows:  Figure  3.4  shows  that 
for  each  character  image,  two  masks,  one  positive  and  one  negative,  weie  placed 
side  by  side.  Argon  laser  light  was  passed  through  a  writing  mask  containing  both 
a  negative  and  positive  character  E.  .'\n  IR  laser  was  used  to  pass  light  through  a 
reading  mask  containing  both  a  negative  and  positive  character  F.  Since  positive¬ 
negative  pairs  of  F’s  were  projected  onto  negative-positive  pairs  of  E  images,  the 
luminescence  observed  was  the  bottom  horizontal  bar  and  represents  a  subtractive 
pattern,  or  difference,  between  the  images.  If  and  only  if  the  two  images  are  identical 
will  there  be  no  luminescence  detected.  Figure  3.5  presents  experimental  results  of 
this  subtractive  correlation  difference  between  an  E  and  an  F. 


Matched  filtering  in  the  frequency  domain 

It  is  preferable  to  filter  in  the  frequency  domain  because  objects  in  the  two  images 
to  be  correlated  do  not  have  to  line  up  in  translation,  i.e.  the  system  provides  shift 
invariance.  The  Fourier  transform  of  a  real  image  is  complex  so  that  complex  image 
information  must  be  stored  in  the  filter.  The  Van  der  Lugt  filter  provides  this  capabil¬ 
ity  [S].  However,  it  needs  an  order  of  magnitude  higher  resolution  materials  because 
the  phcLse  is  stored  by  the  position  of  a  high  spatial  frequency  carrier.  Spatial  light 
modulators  available  today  do  not  have  adequate  resolution  for  this.  Consequently, 
there  heis  been  much  activity  in  phase-only  filters  [9]. 

Luminescent  rebroadcasting  materials  are  considered  capable  of  achieving  high 
enough  resolution  for  storing  complex  functions.  However,  there  are  two  difficulties. 
First  the  write  and  read  frequencies  are  different.  Second,  present  samples  have  too 
much  back  scatter  to  preserve  coherence  of  light  passing  through  the  device.  The  first 
difficulty  can  be  overcome  with  proper  optical  set  up.  The  second  problem  requires 
different  manufacturing  methods  and  is  currently  under  investigation  by  Quantex, 
the  manufacturer  of  the  materials. 


62-22 


CHAPTER  3.  OPTICAL  LR  ASSOCIATIVE  MEMORY 


(a) 


(b) 


Figure  3.2:  Experimental  results  showing  shadow  casting  results  with  luminescent 
rebroadcasting  material:  (a)  X  with  X,  (b)  X  with  0 
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%  intensity 


Figure  3.3:  Experimental  results  showing  the  correlation  results  of  X  with  X  and  X 
with  0 
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Figure  3.4;  Shadow  casting  between  E  and  F  with  luminescent  rebroadcasting  device 
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(a) 


(b) 


(c) 

f*  igure  3.5:  Experimental  results  showing  subtractive  shadow  casting  between  E  and  F 
with  luminescent  rebroadcasting  device:(a)image  E  {b)image  F  (c)pattern  subtraction 
between  E  and  F 
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3.2  Symbolic  substitution  with  LR  devices 

Search  with  symbolic  substitution 


With  looping 

The  luminescent  rebroadcasting  device  haa  a  useful  feature  in  comparison  with  other 
SLMs  for  associative  memory  search.  It  stores  the  current  array  for  successive  logic 
operations  of  the  types  discussed  in  equations  2.1  and  2.4.  Figure  3.6  shows  an  optical 
system  to  implement  word- parallel,  bit-serial  associative  memory  search  with  looping. 

Lenses  Lq  through  Ls  are  for  the  purpose  of  directing  and  collimating  the  beam 
through  the  system,  and  cylindrical  lenses  RLi,  RL2,  RL3  are  used  for  2-D  parallel 
search.  Two  luminescent  rebroadcasting  devices  LRi  and  LR2  are  used  for  storing 
the  database  while  electronically  settable  spatial  light  modulators  ESLMi  and  ESLM2 
serve  as  lO(input-output)  devices.  The  database  is  entered  on  ESLMi  with  associated 
data  entered  on  the  same  row.  For  simplicity,  frames  are  considered  of  fixed  bit  length 
and  matching  is  described  for  which  the  search  bit  pattern  is  exactly  one  frame  long. 
Blue  Ught  is  used  to  image  the  data  onto  luminescent  rebroadcasting  device  LRi.  The 
inverse  is  imaged  onto  LR2  so  that  inversion  is  not  required  at  each  iteration  of  the 
loop  shown.  This  avoids  clearing  the  device  frequently  which  takes  a  long  time,  cis 
discussed  earlier. 

Associative  memory  search  involves  performing  logic  between  the  bit  pattern 
stored  in  LRi  or  LR2  and  the  output  from  a  previous  logic  operation  that  is  fed 
back  via  mirrors  so  that  it  is  misaligned  by  one  column.  The  feedback  path  has  an 
image  intensifier  that  provides  gain  and  converts  light  frequency  from  orange  to  IR. 
Consequently,  the  parallel  logic  operation  performed  is  AND  between  the  array  fed 
bax:k  and  that  stored. 

The  location  of  a  linear  pattern  of  bits  may  be  found  in  parallel  anywhere  on 
the  2-D  array  LRj  by  entering  the  pattern  to  be  matched  in  ESLMi  and  the  inverse 
pattern  in  ESLM2.  As  an  example,  suppose  the  pattern  to  be  found  in  the  data  is  a 
string  of  ones.  The  array  ANDed  with  itself  one  column  right  will  now  have  ones  out 
only  where  there  were  two  ones  adjacent  horizontally.  A  further  iteration  involving 
a  further  shift  right  and  an  AND  will  result  in  an  output  that  has  ones  only  where 
there  were  three  adjacent  ones  in  a  row.  This  may  be  repeated  a  number  of  times 
equal  to  the  length  of  the  pattern  sought.  For  a  search  pattern  having  ones  and  zeros 
the  beam  path  is  controlled  such  that  the  light  will  pass  through  the  luminescent 
rebroadcasting  device  holding  the  inverse  LR2  whenever  a  zero  occurs  in  the  search 


62-27 


CHAPTER  3.  OPTICAL  LR  ASSOCIATIVE  MEMORY 


Figure  3.6:  Parallel  optical  associative  memory  search  with  looping 
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pattern  and  to  the  device  holding  the  data  LRi  whenever  a  one  occurs  in  the  search 
pattern. 

The  output  will  be  one  in  the  right  most  position  of  a  frame  only  if  the  correct 
pattern  is  observed,  moreover,  there  will  be  a  one  at  the  right  most  position  for  every 
instance  of  the  pattern  on  the  whole  array.  In  an  associative  memory  application  the 
data  corresponding  to  the  lines  with  matching  patterns  is  read  out. 


Without  looping 

The  search  mechanism  described  above  involved  a  feedback  loop.  In  order  to  maintain 
sufficient  intensity  of  the  light  beam  in  consecutive  loops,  it  is  necessary  to  use  an 
image  intensifier.  However,  it  is  difficult  to  manufacture  an  image  intensifier  working 
from  visible  to-IR.  Thus  a  design  for  associative  memory  search  without  looping  was 
tested  and  the  results  are  presented  here.  The  basic  idea  here  is  the  same  as  correlation 
in  the  space  domain.  However,  the  search  pattern  is  composed  of  binary  digital  bits. 

Figure  3.7  shows  the  optical  set  up  of  the  design.  LR]  and  LR2  store  the  2-D 
database  and  the  inverted  2-D  database,  respectively.  ESLMi  stores  the  linear  search 
pattern  and  ESLMj  stores  the  inverted  linear  search  pattern.  LR]  and  LR2  recorded 
the  databaise  before  the  search  takes  place.  The  cylindrical  lens  spreads  the  IR  in 
the  horizontal  direction  through  ESLMi  and  ESLM2.  By  precise  adjustment  of  the 
mirrors  those  two  IR  wavefronts  will  meet  in  front  of  the  imaging  lens.  Any  matched 
pattern  in  the  database  will  show  a  full  range  of  bright  spots  in  the  imaging  plane. 
Figure  3.8  presents  the  results  of  searching  an  8-bit  binary  pattern.  Note  the  matching 
of  the  fourth  row  of  the  database  results  in  an  eight-bright-spots  pattern  in  the  image 
plane,  only  partial  or  no  bright  spots  appear  in  unmatched  rows. 

Substitution  with  LR  devices 

It  is  useful  when  modifying  the  database  of  an  associative  memory  to  be  able  to 
replace  a  pattern  wherever  it  occurs  in  a  2-D  array  with  cinother  pattern  [20].  Fig.  3.9 
shows  an  optical  system  providing  this  capability  for  linear  patterns.  The  IR  feedback 
permits  searches  for  a  horizontal  pattern  of  consecutive  bits  as  before.  After  finding 
all  instances  of  a  pattern  of  bits  device  LRl  is  cleared  by  flooding  with  IR  and  the 
output,  indicating  positions  of  the  sought  data,  is  used  to  write  on  LRl  via  the  blue 
feedback  path. 

The  blue  feedback  is  then  used  to  stretch  the  isolated  ones  into  a  string  of  ones 
by  writing  to  adjacent  pixels  at  each  iteration.  The  length  of  this  string  is  that  of  the 
substitution  pattern.  A  linear  array  of  ones  occurs  on  LRl  only  where  the  original 
array  matched  the  sought  pattern.  The  replacement  pattern  repeated  in  every  frame 
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LR:  Luminescent  Rebroadcasting  Material 
ESLM:  Electrically  Addressable  Spatial  Light  Modulator 
L;  Collimating  Lens  SP;  Spatial  Filter 

CL;  Cylindrical  Lens  BS;  Beam  SplittCT 

L^ :  Imaging  Lens  M:  Mirror 

P;  Image  Plane 

Figure  3.7:  Parallel  optical  associative  memory  search  without  looping 
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(a) 


(b) 


(c) 

Figure  3.8:  Experimental  results  showing  associative  memory  search  without  looping 
(a)searched  pattern  (b)clatabase  (c)result 
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Figure  3.9:  Symbolic  substitution  for  search  and  replacement 

is  entered  on  the  ESLM  and  IR  used  to  image  this  onto  device  LRl.  Due  to  the  .\N’D 
operation  the  output  now  has  the  replacement  pattern  only  in  those  locations  where 
a  match  was  found  earlier.  The  result  is  stored  in  device  LR4.  The  pattern  stored 
on  LRl,  having  ones  only  where  the  sought  pattern  occured,  is  read  out  by  flooding 
with  IR  and  is  now  stored  in  device  LR3.  This  is  replaced  by  its  inverse  at  LR3  by 
flooding  LRl  with  blue,  reading  LR3,  converting  to  IR  and  reading  LRl,  and  then 
reading  out  the  inverse  in  LRl  into  LR3.  LRl  can  be  left  cleared.  The  original  data 
is  now  entered  into  the  ESLM  and  imaged  onto  device  LRl  using  blue  light.  Device 
LR3  is  illuminated  with  IR  and  the  output  converted  to  IR  to  read  device  LRl.  The 
stored  array  on  LRl  is  replaced  with  the  original  pattern  modified  to  have  zeroes 
wherever  a  pattern  existed  that  matched  that  sought.  Device  LR4  is  illuminated  and 
the  output  converted  to  blue  in  order  to  OR  with  that  stored  on  LRl.  The  result 
on  LRl  is  the  original  pattern  with  the  new  pattern  wherever  the  original  pattern 
matched  the  sought  pattern.  This  may  be  used  for  future  operations  or  read  out  with 
uniform  IR  illumination  to  the  detector  array. 
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Hidden 

Layer 

(D 


Figure  3.10:  Neural  network  considered 

3.3  Matrix- vector  multiplication  for  neural  net¬ 

works 

Neural  network  schemes  cire  aimed  at  providing  more  flexible  analog  processing  by 
means  of  learning.  A  popular  nonlinear  network  is  considered  with  J  neurons  in  the 
output  layer,  I  hidden  neurons,  and  K  neurons  in  the  input  layer,  Figure  3.10. 
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The  equations  implemented  by  this  network  are 


for  j  =  1  to  J 


where 


/(5,)  =  (l  +  e-^0"^ 


(3.2) 

(3.3) 


This  equation  can  be  implemented  using  matrix-vector  multiplication.  An  optical 
design  for  matrix-vector  multiplication  with  LR  devices  will  be  presented  in  the  fol¬ 
lowing  sections. 


3.3.1  Analog  matrix-vector  multiplication  with  LR 

To  multiply  a  vector  by  a  matrix,  an  astigmatic  optical  processor  can  be  used[4][31][7][14]. 
The  principle  of  analog  matrix- vector  multiplication[6][22]  is  shown  in  Figure  3.11. 
Prior  to  the  multiplication,  the  analog  values  of  the  matrix  are  recorded  on  the  lu¬ 
minescent  rebroadcasting  device,  A  low  sensitivity  film  with  different  grey  levels  of 
transparency  is  created  for  writing.  The  film  is  illuminated  by  an  argon  laser  to  record 
the  analog  values  of  the  matrix  onto  the  LR  device.  The  illumination  energy  must 
be  within  the  linear  operating  region  of  the  device  in  order  to  perform  arithmetic 
operations.  The  vector  is  represented  by  a  row  of  different  grey  levels  on  a  film  trans¬ 
parency.  The  cylindrical  lens  Li  (Figure  3.11)  spreads  the  IR  uniformly  in  the  vertical 
direction,  thus  projecting  the  different  analog  values  of  the  elements  in  the  vector  to 
the  LR  device.  Since  the  matrix  values  have  been  stored  in  the  LR  device  earlier,  the 
output  luminescence  now  represents  the  resulting  analog  values  of  the  matrix-vector 
multiplication  (Section  2.2).  The  cylindrical-spherical  lens  combination  Lo  images 
the  results  of  multiplication  and  integrates  horizontally,  providing  an  output  array 
of  strips  along  a  vertical  line  normal  to  the  optical  axis  of  the  system  thus  forming 
sums  of  products.  Figure  3.12  shows  the  details  of  matrix-vector  multiplication  while 
Figure  3.13  presents  experimental  results  of  a  3x3  matrix- vector  multiplication.  The 
different  grey  levels  represent  different  analog  values  of  the  result. 

Within  the  resolution  of  the  device  it  is  possible  to  perform  matrix-vector  multi¬ 
plication  to  a  much  higher  order.  Figure  3.14  demonstrates  the  experimental  result 
of  a  10x10  matri.x-vector  multiplication. 
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Figure  3.11:  Optical  set  up  of  matrix 


I 


-vector  multiplication  with  LR  device 
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c  =  a  b 
1  11  1 


a  b, 


a  b 


12  2  +  13  3 


^  2  =  b,  ^  ^.b 


21  1 


22  2  +  23  3 


=  a  b.  .  a^  b^  .  a^^b 
3  31  1  ^  32  2  +  33  3 


Figure  3.12:  Principle  of  matrix-vector  multiplication 
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(b) 


(c) 

Figure  3.13:  Experimental  result  of  a  3x3  matrix-vector  multiplication  (a)  vector  (b) 
matrix  (c)  result  of  multiplication 
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Figure  3.14:  Experimental  result  of  a  10x10  matrix- vector  multiplication  (a)  vector 
(b)  matrix  (c)  result  of  multiplication 
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Chapter  4 

Comparison  of  AM  approaches 


Laboratory  approaches  for  three  proposed  AM’s  are  presented  in  chapter  3.  Each 
of  the  approaches  has  its  advantages  and  disadvantages.  Generally  speaking,  all  of 
the  approaches  take  advantage  of  the  high  resolution(0.6  /i),  high  speed(nsec)  and 
simple  operating  procedures  of  the  LR.  The  disadvantage  of  the  devices  is  that  there 
is  no  gain  so  it  is  necessary  to  use  an  image  intensifier  in  feedback  optical  systems  to 
compensate  for  energy  otherwise  lost  in  propagating  through  the  system.  The  three 
different  wavelengths  involved  also  pose  problems  for  feedback  systems. 


4.1  Advantages  and  disadvantages  of  correlation 
AM  using  LR  devices 

4.1.1  Advantages 

Optics  has  a  distinct  advantage  in  speed  for  multiple  correlations  of  2-D  images 
because  the  correlation  for  an  image  is  pefrformed  in  parallel.  The  high  resolution, 
memory  behavior,  and  high  speed  of  LR  devices  makes  them  suitable  for  highly 
parallel  dynamic  correlators  or  matched  filters. 

In  chapter  3  section  3.1  we  described  some  experimental  results  for  correlation  AM 
using  LR  devices  in  the  space  domain.  Because  of  the  fast  reading  and  writing  one 
can  perform  fast  spatial  correlation  between  a  series  of  images  against  a  previously 
recorded  image. 
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4.1.2  Disadvantages 

In  order  to  perform  correlation  in  the  frequency  domain  for  shift  invariance,  it  is 
necessary  to  store  complex  images  in  a  filter.  The  high  resolution  of  LR  devices 
makes  it  possible  to  store  complex  images  on  it.  Correlation  in  the  frequency  domain 
requires  a  transmissive  or  reflective  storage  medium  for  which  the  output  relative 
phase  relationship  is  preserved.  The  LR  devices  available  today  provide  an  incoherent 
orange  output  and  a  scattered  IR  output  because  they  do  not  have  a  clear  transmissive 
capability.  The  scattering  of  the  IR  light  from  the  back  of  the  devices  make  it  very 
hard  to  observe  the  correlation  results.  This  is  the  main  reason  that  the  correlation 
was  performed  in  the  space  domain  for  this  project.  The  manufacturer  is  conducting 
reseaxch  on  transmissive  LR  devices  and  once  the  device  is  available,  correlation  in 
the  frequency  domain  should  be  fairly  easy  to  perform  based  on  our  e.xperiences. 


4.2  Advantages  and  disadvantages  of  symbolic  sub¬ 
stitution  AM  with  LR  devices 

4.2.1  Advantages 

Symbolic  substitution  differs  from  neural  networks  in  that  the  information  to  be 
stored  is  entered  directly  rather  distributed  in  the  database  cis  a  result  of  learning. 
The  difference  between  the  search  phcise  of  symbolic  substitution  and  correlation  is 
that  a  digital  technique  is  used  so  that  an  exact  match  to  the  input  is  made.  However 
only  part  of  the  information  sought  need  be  supplied,  e.g.  a  few  consecutive  letters 
anywhere  in  a  name  is  sufficient  to  find  all  names  in  a  database  having  this  sequence 
of  letters.  Hence,  the  inexactness  in  addressing  hais  a  different  form  to  a  correlation 
system  where  an  average  correctness  is  required  for  the  input. 

Two  optical  systems  designed  to  perform  the  search  pheise  of  symbolic  substitution 
were  presented  in  chapter  3,  section  3.2.  The  biggest  advantage  of  these  systems  is  the 
ability  to  perform  parallel  search  everywhere  in  the  database.  The  length  of  the  search 
pattern  with  the  current  designs  is  unlimited  as  long  as  a  suitable  image  intensifier 
is  available.  After  the  search  pattern  is  found  one  can  substitute  the  desired  pattern 
in  a  parallel  fashion.  With  the  fast  reading  and  writing  ability  of  LR  devices  one  can 
search  and  substitute  different  databases  in  a  rapid  way. 
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4.2.2  Disadvantages 

Optical  symbolic  substitution  involves  looping  during  searching  and  substitution.  As 
the  length  of  the  search  pattern  increases,  the  number  of  loop  iterations  is  increased 
«md  the  intensity  of  light  will  decrease  very  quickly.  To  insure  proper  operations  it  is 
necessaxy  to  incorporate  an  image  intensifier  inside  the  loop.  The  intensifier  is  needed 
to  convert  luminescent  light  to  IR  or  to  enhance  the  input  IR.  Image  intensifiers  that 
meet  the  required  specifications  were  not  available  for  this  effort.  Available  night- 
vision  intensifiers  work  in  the  opposite  way,  i.e.  they  convert  IR  to  visible  light. 
Without  a  proper  image  intensifier  the  length  of  the  search  pattern  is  constrained.  A 
binary  subtractive  digital  correlator  performed  similarly  to  a  spatial  correlator  and 
avoided  the  looping.  However  a  row  of  spots  must  now  be  detected  rather  than  a 
single  spot  for  each  match. 

4.3  Advantages  and  disadvantages  of  neural  net¬ 
work  AM  with  LR  devices 

4,3.1  Advantages 

A  computer  aided  design  example  in  structures  wais  illustrated  by  the  principal  in¬ 
vestigator  in  Ref.  [23].  The  AM  provides  good  starting  estimates  for  more  accurate 
subsequent  nonlinear  parameter  estimation  in  design  [29], [30]  or  in  other  nonlinear 
parameter  estimation  situations  [27], [24].  Similiarly  in  complex  real-time  situations, 
such  as  the  battle  field,  AM  is  expected  to  be  faster  than  rule  based  systems. 

The  neural  network  AM  differs  from  correlation  by  using  learning  to  determine  the 
distribution  of  information  stored  rather  than  explicitly  storing  the  information.  This 
permits  adaption  to  the  environment  and  possibly  less  requirement  by  the  developer 
to  understand  the  nature  of  the  information  stored.  The  goal  of  machine  learning 
is  to  place  a  greater  computational  burden  on  the  machine  in  order  to  decrease  the 
effort  by  the  human  developer.  A  layer  of  the  network  in  Fig.  3.10  was  implemented 
using  LR  devices  by  means  of  an  analog  matrix-vector  multiplier.  Nonlinearity  may 
be  obtained  from  the  natural  saturation  of  the  device.  The  optical  system  and  the 
results  were  described  in  chapter  3,  section  3.3.  With  the  design  shown  in  Fig. 
3.11,  the  dimension  of  the  matrix  is  only  limited  by  the  physical  size  of  the  lenses.  A 
100x100  matrix-vector  multiplication  is  currently  under  investigation  by  the  principal 
investigator. 

The  application  of  matrix- vector  multiplication  in  neural  networks  is  simpler  than 
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for  normal  zirithmetic  computation.  First,  the  detectors  do  not  requires  a  large  dy¬ 
namic  range  because  the  network  uses  a  clipping  function  at  each  stage.  Secondly, 
high  uniformity  across  the  array  is  not  needed  because  the  learning  will  compensate 
for  spatial  differences.  Thirdly,  the  weights  representing  values  to  be  set  on  the  LR 
device  «ire  adjusted  rather  than  rewritten.  Therefore,  it  is  not  necessary  to  clear  the 
device  which  is  time  consuming  for  these  devices.  Further,  the  choice  of  wavelength 
determines  whether  the  value  is  adjusted  up  or  down. 


4.3.2  Disadvantages 

The  speed  of  multiplication  is  limited  by  the  speed  of  the  electo-optic  interface  de¬ 
vices.  The  advantage  of  optics  over  electronics  is  greater  for  larger  matrices  because 
the  optics  has  greater  parallelism  in  the  processing  step.  Larger  matrix-vector  multi¬ 
plication,  such  as  500  by  500,  is  possible  with  the  same  optical  approach  and  materials. 

In  order  to  have  more  than  one  layer  it  is  necessary  to  have  devices  for  which  the 
output  frequency  for  one  is  a  match  to  the  input  frequency  of  the  other.  Such  devices 
are  possible  but  have  not  yet  been  constructed.  A  more  expensive  alternative  is  to 
have  a  frequency  conversion  unit  such  as  an  image  intensifier.  Such  intensifiers  are 
not  presently  available. 
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Chapter  5 
Conclusion 


Associative  memory  is  becoming  increasingly  important  in  computer  science  because 
of  the  need  for  larger  memories  with  fast  recall.  Three  approaches  to  optical  associa¬ 
tive  memory  using  luminescent  rebroadcasting  devices:  correlations,  neural  networks 
and  symbolic  substitution  were  described.  Experimental  results  and  the  comparison 
of  advantages  and  disadvantages  of  each  approach  were  presented. 

Symbolic  substitution  is  useful  where  a  part  of  the  data  must  be  matched  exactly 
while  correlation  allows  an  approximate  fit  to  data.  Neural  networks  are  useful  where 
interpolation  or  extrapolation  is  required  or  where  adaption  is  needed.  For  example, 
where  the  search  is  for  strings  that  are  between  those  stored.  Further  an  associated 
computer  is  assumed  for  performing  learning. 

The  performance  of  the  optical  systems  is  severely  limited  by  the  availability  of 
optical  devices.  Without  a  transmissive  LR  device,  correlation  is  limited  to  the  space 
domain  which  cannot  provide  shift  invariance  as  would  the  frequency  domain.  With¬ 
out  an  image  intensifier  for  converting  frequency  and  providing  gain  the  symbolic 
substitution  AM  is  constrained  by  the  length  of  the  search  pattern.  The  development 
of  transmissive  LR  devices  and  suitable  image  intensifiers  is  under  investigation.  Al¬ 
though  very  large  matrix-vector  multipliers  are  possible  using  LR  devices,  in  order  to 
produce  a  neural  network  with  more  than  one  layer,  matching  device  pairs  or  suitable 
frequency  changing  intensifiers  are  needed. 

Progress  in  device  technologies  suggests  that  all  of  the  approaches  considered 
may  eventually  be  used  for  associative  memory  according  to  the  requirements  for  the 
system.  Therefore,  the  development  of  devices  and  systems  for  using  such  devices  is 
strongly  encouraged. 
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ABSTRACT 


It  was  desired  to  develop  an  algorithm  for  the  automated  translation  of  finite 
state  machines  from  state  table  form  to  optimized  VHDL  form.  To  do  this, 
algorithms  are  needed  for  reducing  the  state  machine  to  simplest  form,  making  state 
assignments,  producing  minimal  logic  equations  to  represent  the  state  machine,  and 
producing  VHDL  code  which  describes  the  intended  circuit.  Various  such  algorithms 
were  examined  and  a  prototype  program  written  to  perform  this  translation. 
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A.  INTRODUCTION 


It  was  desired  lo  develop  an  algorithm  for  the  automated  translation  of  finite 
state  machines  from  state  table  form  to  optimized  VHDL  form.  The  translation 
process  involves  reducing  the  state  machine  to  its  simplest  form,  making  state 
assignments,  producing  minimal  logic  equations  to  represent  the  state  machine,  and 
producing  VHDL  code  which  describes  the  intended  circuit. 

A  finite  state  machine  is  a  model  of  a  sequential  logic  network.  The  term 
sequential  indicates  that  its  outputs  are  dependent  not  just  on  its  current  inputs  but 
also  on  past  inputs.  This  is  accomplished  by  use  of  a  memory.  Rather  than  attempt 
to  keep  track  of  all  past  inputs,  a  finite  number  of  states  are  used,  each  of  which 
represents  an  equivalent  set  of  input  histories.  Each  input  may  cause  the  machine  to 
enter  a  new  state  and  affect  its  output.  An  electrical  circuit  for  a  finite  state  machine 
includes  inputs,  a  combinational  logic  part,  a  memory,  and  outputs  as  shown  in 
figure  I . 
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Figure  1.  Sequential  Network 
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VHDL  is  a  hardware  description  language  intended  for  the  design,,  description, 
and  simulation  of  electrical  hardware  systems  and  components.  The  description  of  an 
object  is  in  two  parts,  an  interface  and  an  architecture.  This  allows  for  separation  of 
function  and  implementation.  For  versatility  objects  can  be  described  by  behavior, 
structure,  data  flow,  or  any  combination  of  the  three.  [11,  12]  See  appendix  A  for  a 
more  detailed  description. 

Only  completely  specified,  synchronous,  single  input/single  output  machines 
were  considered  for  the  translation  from  state  table  to  VHDL  form.  A  prototype 
program,  FSM,  (appendix  B)  to  perform  this  translation  was  written  using  Pascal  on 
an  IBM  PC.  The  following  sections  outline  algorithms  available  and  identify  those 
used  for  the  prototype  program.  Complete  examples  of  the  process  of  translating  a 
state  table  to  logic  equations  is  given  in  appendix  D. 
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B.  FROM  STATE  TABLE  TO  LOGIC  EQUATIONS 


1.  State  Table  Simplification. 

A  state  table  is  a  collection  of  transitions  between  states  and  the  outputs 
produced  either  at  the  state  (a  Moore  machine)  or  on  transition  between  states 
(a  Mealy  machine—used  by  FSM,  the  prototype  program)  [3].  Figure  2  shows  a 
transition  diagram  and  corresponding  state  table  for  a  fmite  state  machine. 


Input  for  the  prototype  program,  read  from  a  file,  includes  a  short  (80  character) 
description  of  the  finite  state  machine,  the  number  of  states  in  the  machine,  and  the 
state  transitions.  Each  transition  is  specified  by  its  next  state  and  associated  output. 
Since  only  completely  specified  single  input/single  output  machines  are  considered, 
there  are  exactly  two  transitions  for  each  state.  States  are  initially  assumed  to  be 
numbered  sequentially  starting  with  zero  which  is  assumed  to  be  the  initial  state. 
Additional  input  accepted  directly  from  the  user  consists  of  the  name  the  finite  state 
machine  is  to  be  given  in  the  VHDL  code,  the  name  of  the  file  containing  the  state 
transitions,  the  name  of  the  file  to  which  the  VHDL  code  is  to  be  written,  the  type  of 
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flipflop  to  use  and  its  delay  time,  and  the  implementation  of  the  combinational  logic 
and  its  delay. 

Reducing  the  number  of  states  in  a  state  machine  can  reduce  the  number  of 
memory  elements  needed  to  represent  the  states  of  the  machine  and  help  minimize  tl  j*, 
combinational  logic  used  to  determine  the  machine's  outputs  and  next  states.  To 
reduce  a  state  table  to  its  simplest  form,  unnecessary  states  must  be  removed.  These 
include  redundant,  unreachable,  and  equivalent  states.  As  redundant  states  are 
equivalent  states  they  need  not  be  considered  separately  although  algorithms  exisi 
their  removal.  Unreachable  states  however  can  only  be  equivalent  tc  other 
unreachable  states  and  must  therefore  be  handled  separately. 

Removal  of  unreachable  states  is  accomplished  by  creating  a  set  of  reachable 
states  [4].  Initially  only  a  known  reachable  state,  the  initial  state,  is  a  member  of  uiis 
set.  An  iterative  process  is  then  set  up  in  which  the  next  states  of  each  member  of 
the  reachable  set  are  added  to  the  set  (if  they  are  not  already  members).  When  no 
more  states  can  be  added,  the  set  is  complete  and  any  states  not  in  the  set  are 
unreachable  and  can  be  .removed  from  the  state  table.  References  to  these 
unreachable  states  as  next  states  of  reachable  states  need  not  be  considered  in  their 
removal  as  there  can  be  none. 

Removal  of  equivalent  states  can  be  handled  by  use  of  equivalence  sets  [I]  or  an 
implication  table  [3].  In  either  case  all  states  are  at  first  considered  to  be  equivalent 
and  equivalences  are  then  ruled  out.  When  the  equivalent  states  of  the  state  table 
have  been  identified,  all  but  one  of  the  states  in  each  group  of  equivalent  states  are 
removed  from  the  state  table  and  all  references  to  them  as  next  states  are  replaced  by 
the  state  kept. 
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a)  state  table  after  removal 
of  unreachable  states 
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b)  implication  table 
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c)  equivalence  sets 
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Figure  3.  Equivalent  State  Removal 
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With  an  implication  table  (figure  3b),  one  entry  exists  for  each  possible  pairing 
of  states  with  the  exception  of  pairing  a  state  with  itself.  A  mark  is  placed  in  an 
entry  when  that  pair  is  known  not  to  be  equivalent.  The  first  of  these  marks  are 
placed  on  the  basis  of  differing  outputs  of  the  states'  transitions  as  states  with 
differing  outputs  cannot  be  equivalent.  The  remaining  entries  are  then  checked  in 
repeated  passes  of  the  table  on  the  basis  of  the  states'  next  states.  When  a  pair  of 
next  states  for  an  entry  have  been  found  not  to  be  equivalent,  that  entry  can  be 
marked.  When  a  pass  yields  no  additional  marks,  the  remaining  unmarked  entries 
indicate  equivalent  states. 

When  using  equivalent  sets  (figure  3c),  the  states  are  first  divided  into  separate 
sets  according  to  the  outputs  of  their  transitions  to  next  states.  For  the  iterative  part 
of  this  process  the  states  in  each  set  are  assigned  a  subscript  for  each  transition 
indicating  which  set  its  next  state  is  in.  Each  set  is  then  broken  down  further  into 
new  sets  for  which  the  subscripts  of  all  member  states  match.  'This  is  repeated  until 
no  new  sets  are  created.  At  this  time,  each  set  contains  only  equivalent  states.  The 
use  equivalent  sets  was  chosen  over  an  equivalence  table  for  the  prototype  program 
because  of  the  simpler  data  structure. 

2.  State  Assignment. 

In  the  circuit  implementation  of  a  finite  state  machine  each  state  is  represented 
by  a  binary  n-tuple  which  is  a  concatenation  of  the  values  of  the  memory  elements 
when  the  machine  is  in  that  state,  n  being  the  number  of  memory  elements.  The 
choice  of  these  n-tuples,  or  state  assignments,  can  affect  the  minimization  of  the 
combinational  logic  part  of  the  circuit.  For  a  given  machine  there  are  2"  possible 
state  assignments.  Story  (10)  gives  the  number  of  possible  combinations  of 
assignments  as 
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(2”  -  1)! 

(2"  -  /?)!«! 

where  R  is  the  number  of  states  in  the  machine.  Thus  as  the  number  of  states  grows 
large,  the  number  of  possible  state  assignments  and  their  possible  combinations  grows 
very  large.  Currently,  there  is  no  method  for  determining  an  optimal  state  assignment 
without  comparing  the  results  of  assignments  through  trial  and  error.  Story  [10]  does 
offer  a  method  of  reducing  the  number  of  assignments  which  need  to  be  checked. 

The  prototype  program  uses  the  natural  assignment  method  which  consists  of 
numbering  the  states  sequentially  starting  with  zero. 

3.  Karnaugh  Map  Production. 

Karnaugh  map  representations  of  the  machine  outputs  and  next  state  signals  are 
created  to  help  in  the  production  of  the  logic  equations  (3).  Two  maps  are  required 
for  each  JK  or  RS  flipflop,  one  is  required  for  each  D  flipflop,  and  one  for  each  state 
machine  output.  The  prototype  program  can  produce  VHuL  descriptions  using  JK, 
RS,  or  D  type  flipflops.  Maps  are  represented  internally  by  a  list  of  on*cells  and  a  list 
of  don't-care-cells.  All  cells  not  listed  are  off.  Figure  4  shows  the  production  of  J 
and  K  maps  for  one  memory  element  of  a  finite  state  machine. 

Story  [10]  gave  formulas  for  finding  on-cells  and  don't-care-cells  for  the 
Karnaugh  maps  for  JK  flipflops: 

g-l  2"  R-\  2" 

y'=0  /=0  y=0  i—R 

«-l  2"  R-\  2” 

'f-  T/aTs'  -yi'^j + 

;=0  /=0  y=0  j=R 


63-8 


where  j  is  the  state  table  row  index,  i  is  the  input  index,  R  is  the  number  of  states, 
n  is  the  number  of  inputs,  m  is  the  number  of  flipflops,  y  and  y  represent  the  current 
and  next  state  values  of  the  flipflop,  QX  (rather  than  SX  used  by  Story)  represents 
the  cell  number  (a  concatenation  of  current  state  and  input),  Q  represents  a  grouping 
of  cell  numbers  (two  cells  for  a  single  input  machine)  for  when  the  input  values  do 
not  matter,  and  d  (Story  uses  0.5)  indicates  don't-care-cells.  Formulas  for  RS  and  D 
flipflops  can  be  written  similarly: 


R~\  2“  R-l  2"  2" 


j=0  (=0  j=0  1=0  j=R 


R-\  2"  R-\  2"  2” 


y=0  1=0  y=0  1=0  j=R 


R-l  2"  2" 

y=0  i=0  J=R 


The  type  of  memory  element  chosen  for  a  circuit  can  also  afTect  the 
minimization  of  the  combinational  logic  part  of  the  circuit.  The  only  method  of 
determining  which  type  will  yield  minimal  resuKs  is  trial  and  error.  There  are 
however  only  a  limited  number  of  common  types  available.  The  decision  of  which 
type  to  use  is  left  to  the  user  of  the  program,  as  other  factors  than  just  minimization 
may  be  relevant.  No  provision  is  made  for  mixing  flipflop  types  in  a  single  machine 
circuit. 

4.  Logic  Equation  Production. 

It  is  desirable  that  the  logic  equations  describing  a  finite  state  machine  have 
both  a  minimal  number  of  gates  and  a  minimal  number  of  gate  levels.  Fewer  gates 
means  cheaper  production  costs  and  fewer  gate  levels  means  faster  operation. 
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Figure  4.  Karnaugh  Maps 


Toward  these  ends  the  prototype  program  produces  minimal  two-level  sum  of 
products  equations  using  only  NOT,  AND,  and  OR  operations. 

The  standard  procedure  for  producing  logic  equations  from  Karnaugh  maps  is 
the  Quine/McCluskey  (5,  7]  method  (figure  5b).  With  this  method  a  list  of  the 
on-cells  and  don't-care-cells,  called  implicants,  is  made.  They  are  grouped  according 
to  the  number  of  1  bits  in  their  binary  representations.  Each  implicant  in  a  group  is 
combined  with  as  many  implicants  in  the  following  group  (implicants  with  one  more 
1  bit)  as  possible,  forming  new  implicants  which  are  grouped  separately  according  to 
number  of  1  bits.  An  implicant  may  be  combined  with  another  if  their  binary 
representations  match  except  in  one  position  (e.g.  0010  and  0110).  The  bit  position 
in  which  the  two  differ  is  replaced  by  a  don't-care-symbol  (e.g.  0-10  or  0x10).  In 
combinations  involving  implicants  with  don't-care  positions,  the  don't-care  positions 
must  match  exactly  in  both  implicants.  The  implicants  which  combine  to  form  a  new 
implicant  are  marked  as  such.  When  no  new  implicants  can  be  formed,  an  implicant 
table  is  made  from  the  implicants  which  have  not  been  marked. 
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b)  Quine/McCluskey  method 
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c)  Prather  Method 
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d)  Final  Logic  Equation 
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Figure  5.  Logic  Equation  Production 
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The  implicants  become  the  rows  of  the  implicant  table  arranged  so  that  priority 
is  given  to  the  number  of  on-cells  covered  and  the  number  of  don't-care  positions 
(indicating  fewer  literals  and  thus  fewer  gate  inputs).  The  columns  of  the  implicant 
table  are  labeled  by  the  on-cells  of  the  map.  Entries  of  a  row  which  are  in  columns 
that  represent  on-cells  covered  by  the  implicant  are  marked. 

A  prime  implicant  is  one  which  alone  covers  an  on-cell  (is  the  only  implicant 
with  an  entry  in  that  column  marked).  Prime  implicants  are  essential  to  the  equation 
and  are  removed  from  the  table  along  with  the  columns  they  cover  and  become  the 
basis  of  the  equation.  All  remaining  columns  are  now  covered  by  two  or  more 
implicants.  Implicants  may  be  removed  from  the  table  without  affecting  it  if  they  are 
dominated.  One  implicant  dominates  another  if  for  every  column  covered  by  the 
second,  the  first  also  covers  that  column.  If  two  implicants  dominate  each  other  and 
one  has  fewer  don't-care  positions,  it  should  be  removed;  otherwise  the  decision  is 
arbitrary.  If  removing  dominance  from  the  table  leaves  columns  which  are  covered 
by  only  one  implicant,  those  implicants  should  be  selected-removed  from  the  table 
along  with  the  columns  they  cover  and  added  to  the  equation.  If  no  columns  are 
covered  by  only  one  remaining  implicant,  then  an  implicant  must  be  chosen  by  other 
means.  Normally  the  implicant  chosen  is  the  one  highest  in  the  table  covering  a 
column  having  the  least  number  of  implicants  covering  it.  The  process  of  removing 
dominance  and  chosing  implicants  is  repeated  until  the  implicant  table  is  empty.  The 
equation  is  now  complete.  While  the  equation  may  not  be  unique,  it  is  minimal. 

A  modification  of  the  Quine/McCluskey  method  was  given  by  Prather  [6] 

(figure  5c).  This  technique  identifies  essential  cells  (prime  implicants)  by  attempting 
to  complete  for  each  on  cell  the  ncell  indicated  by  adjacent  on-cells  and 
don't-care-cells.  If  this  ncell  can  be  completed,  it  is  essential  to  the  equation.  If  not, 
then  the  basic  (nonessential)  cells  which  cover  the  first  cell  can  be  found  by 
attempting  to  complete  the  ncell  without  one  or  more  of  the  original  adjacent  cells. 
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First  all  essential  ncells  are  found  and  the  cells  they  cover  marked.  Then  all  basic 
cells  are  found  for  those  on  cells  not  yet  covered  and  used  to  form  an  implicant  table 
which  is  reduced  as  above. 

An  ncell  is  completed  by  checking  to  see  if  all  the  necessary  cells  are  either  on  or 
don't-care.  The  on-cells  and  don't-care-cells  adjacent  to  the  cell  to  be  covered  are 
identified  first.  The  number  of  these  adjacent  cells  indicates  the  size  of  the  ncell  and, 
as  a  power  of  two,  the  number  of  individual  map  cells  covered  (e.g.  three  adjacent 
cells  indicate  a  3cell  covering  eight  cells,  zero  indicates  a  Ocell  covering  one  cell).  The 
next  group  of  cells  are  identified  by  adding  the  delta  of  each  original  adjacent  cell  to 
each  of  the  following  adjacent  cells.  New  groups  of  cells  are  found  until  one  contains 
only  a  single  cell,  at  which  time  the  ncell  is  complete,  or  until  an  indicated  cell  is 
neither  an  on-cell  nor  a  don't-care-cell.  If  the  ncell  cannot  be  completed,  an  attempt 
to  find  basic  cells  can  be  made  by  omitting  each  of  the  original  adjacent  cells,  one  at 
a  time,  whose  delta  was  involved  in  identifying  the  cell  which  failed  to  compete  the 
ncell.  The  Prather  method  was  the  method  chosen  for  the  prototype  program  as  it 
was  easier  to  represent. 
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C.  VHDL  DESCRIPTION  OF  A  FINITE  STATE  MACHINE 


The  prototype  translation  program  produces  two  files  as  output.  One  is  a  trace 
of  its  operation  including  the  initial  state  table,  simplified  state  tables,  Karnaugh  map 
representations,  essential  cells  and  implicant  tables  for  those  equations  with 
nonessential  cells,  complete  equations,  and  timing  of  operation.  The  other  file  is  the 
VHDL  source  code  of  the  finite  state  machine,  a  combination  of  structural  and 
behavioral  descriptions.  VHDL  version  7.2  was  used  for  this  file.  Syntax  was 
checked  for  correctness  with  the  VHDL  Analyzer.  Sample  output  for  these  files  can 
be  found  in  appendix  D. 

The  VHDL  code  description  of  the  state  machine  is  produced  with  the  use  of  a  I 

1 

template  file  (appendix  C)  containing  markers  indicating  the  information  needed. 

Markers  in  the  template  are  set  off  from  the  code  by  brackets.  When  in  copying  the 
VHDL  code  file  from  the  template  a  marker  is  found,  it  is  identified  and  replaced  by 
its  substitution  string.  Substitution  strings,  with  the  exception  of  the  actual  logic 
equations,  are  determined  from  parameters  prior  to  writing  the  VHDL  code  file.  The 
logic  equations  are  formulated  from  their  internal  representation  and  written  when 
the  logic  marker  is  found. 
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D.  CONCLUSIONS 


The  logic  equations  for  the  the  finite  state  machines  in  the  examples  shown  in 
appendix  D  were  checked  for  correctness  and  if  from  a  text,  compared  to  the  solution 
given  where  possible.  The  example  solutions  were  also  compared,  appendix  E,  with 
the  output  of  Meg  [9],  a  state  machine  equation  generator. 

The  VHDL  output  of  four  of  the  examples  in  appendix  D  (examples  1,  5,  6, 
and  7)  was  run  with  the  1076/B  VHDL  Simulator.  As  the  original  VHDL  code  was 
version  7.2,  some  minor  changes  were  required  to  make  the  machines  run.  They  did, 
however,  perform  as  expected. 

At  present  the  translation  program  is  not  software  interfaced  to  any  other 
programs  for  input  or  output.  The  VHDL  output  file  could  be  used  as  a  source  file 
for  simulation  or  simply  as  a  circuit  description. 

Following  are  some  possible  extensions  to  the  program.  A  graphical  interface 
for  input  would  be  useful.  The  handling  of  asynchronous,  multi-input/multi-output, 
and  incompletely  specified  state  machines  would  make  the  program  more  realistic  in 
terms  of  use.  Version  7.2  of  VHDL  was  used  for  the  prototype  program  as  that  was 
the  latest  version  of  the  analyser  available.  A  more  recent  version  would  be  desired 
for  actual  use.  Also,  standard  library  components  for  the  fiipflops  would  make  the 
designs  more  compatible  with  existing  systems  and  allow  greater  device  independence. 
The  examination  of  various  state  assignments  would  ensure  that  the  final  logic 
equations  were  indeed  the  minimal  possible.  Interfacing  the  VHDL  with  EDIF  [2] 
would  allow  graphical  representation  of  the  electrical  circuit. 
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NOMENCLATURE 


A  =  amplitude  (trough  to  peak)  of  versine  (1  -  cosine),  in. 

A^  =  coefficients  of  fifth-order  polynomial  defining 
force/displacement  characteristic 

2 

Aj^  =  effective  hydraulic  piston  area,  in^. 

2 

Aq  =  area  of  hydraulic  orifice,  in'^. 

.  2 

A--  =  effective  flow  area  through  hydraulic  orifice,,  in  . 

2 

Ap  =  effective  pneumatic  piston  area,  in"^. 

B  =  bulk  modulus  of  hydraulic  fluid,  psi 

Cjj  =  discharge  coefficient  for  hydraulic  orifice 

2  2 

=  quadratic  hydraulic  damping  coefficient,  Ib-s  /in  . 

=  effective  linear  viscous  damping  coefficient  of  tire, 
Ib-s/in. 

Djj  =  bottom  out  deflection  of  tire,  in. 

Dg  =  effective  diameter  of  chamber  that  will  expand  due  to 

internal  pressures,  in. 

s  deflection  of  tire  (positive  for  compression),  in. 

E  =  Young's  modulus  of  walls  of  chamber  that  will  expand  due 
to  high  internal  pressure,  psi 

Ff  =  sliding  friction  force  between  the  pneumatic  and 
hydraulic  chambers,  lb  — 

F  *  stiffness  force  between  the  pneumatic  and  hydraulic 

^  chambers  when  fully  extended,  lb 

Fh  *  hydraulic  damping  force,  lb 

Fj._  =  sum  of  hydraulic  damping  and  pneumatic  stiffness  forces, 
^  lb 

Fj^  =  length  of  flat  in  Class  B  bump,  in. 

Fp  =  pneumatic  stiffness  force,  lb 

F^  a  compressive  (positive)  force  in  tire,  lb 

F^  a  total  force  in  tire  due  to  tire  compression,  bottoming 

out,  and  damping,  lb 
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Kyj  =  effective  linear  stiffness  of  rim  when  tire  bottoms  out, 
Ib/in. 

L  =  aerodynamic  lift  on  nose,  lb 

Lg  =  effective  length  of  chamber  that  will  expand  due  to 

internal  pressures,  in. 


=  wavelength  of  versine,  in. 

9 

=  effective  mass  of  aircraft  nose,  Ib-s  /in. 

2 

Mp  =  effective  mass  of  floating  jump  piston,  Ib-s  /in. 

O 

=  effective  mass  of  wheel,  Ib-s  /in. 

P  =  pressure,  psia 

Pg  =  atmospheric  pressure,  psia 

Pj^  =  pressure  in  hydraulic  fluid,  psia 

Pj^  =  initial  pressure  in  pneumatic  chamber,  psia 

Pj  =  pressure  under  floating  Jump  piston,  psia 

Pp  =  pressure  in  pneumatic  chamber,  psia 

Qg  =  ef|ective  flow  rate  of  hydraulic  fluid  through  orifice, 

=  ideal  flow  rate  of  hydraulic  fluid  through  orifice,  in^/s 

=  height  of  up  and  down  ramps  of  Class  B  and  E  obstacles, 
in. 

=  length  of  up  and  down  ramps  of  Class  B  and  E  obstacles, 
“in.  ” 

S  =  displacement  between  pneumatic  and  hydraulic  chambers 

(stroke-positive  for  compression),  in. 

Sgmax  “  maximum  distance  that  floating  piston  can  travel  when 
^  system  is  in  static  equilibrium,  in. 

T  =  simulation  time,  s 

=  temperature  of  hydraulic  fluid,  F 

t^  =  effective  thickness  of  wall  of  chamber  that  will  expand 
due  to  internal  pressures,  in. 

Tq  =  simulation  time  at  which  tire  initially  contacts  the 
obstacle,  s 
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=  simulation  time  at  which  the  tire  has  completely 
traversed  the  obstacle,  s 

=  simulation  time  at  which  the  tire  leaves  the  up  ramp  of 
the  Class  B  or  E  obstacle,  s 

T2  =  simulation  time  at  which  the  tire  contacts  the  down  ramp 
of  the  Class  B  or  E  obstacle,  s 

V  =  volume,  in^. 

Vj,  =  change  in  hydraulic  fluid  volume  due  to  high  internal 
pressures ,  in^ . 

Vq  =  change  in  pneumatic  chamber  volume  due  to  high  internal 
pressures,  in'^. 

=  velocity  of  hydraulic  fluid  in  hydraulic  chamber  away 
from  orifice,  in/s,  or  volume  of  hydraulic  fluid  under 
14.7  psia,  in^. 

=  initial  volume  of  gas  in  pneumatic  chamber,  in^. 

Vp  =  volume  of  gas  in  pneumatic  chamber,  in^. 

Vj^  =  longitudinal  velocity  of  aircraft,  in/s 

Vq  =  velocity  of  hydraulic  fluid  through  orifice,  in/s 

Wj^  =  effective  weight  of  aircraft  nose,  lb 

Wp  =  effective  weight  of  floating  jump  piston,  lb 

=  effective  weight  of  wheel,  lb 

Yq  =  vertical  height  (displacement)  of  obstacle,  in. 

Yj^  =  displacement  of  aircraft  nose  with  respect  to  static 

equilibrium  position,  in. 

Yp  =  displacement  of  floating  jump  piston  with  respect  to 
static  equilibrium  position,  in. 

Y^  =  displacement  of  wheel  with  respect  to  static  equilibrium 
position,  in. 

P  =  compressibility  of  a  fluid,  In^/lb 

y  =  polytropic  gas  constant 

P  =  mass  density  of  hydraulic  fluid,  Ib-s^/in^. 

(T  =  weight  density  of  hydraulic  fluid,  Ib/in^. 
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INTRODUCTION 


A  primary  responsibility  of  the  Mechanical  Branch  of  the 
Vehicle  Equipment  Division  of  the  Flight  Dynamics  Laboratory  is 
to  assess  the  performance  of  landing  gears  during  taxi,  and 
landing  and  takeoff.  A  well-equipped  facility  is  available  for 
testing  individual  gears.  In  addition,  computer  simulations  of 
the  dynamics  of  the  complete  aircraft  are  available,  and  have 
been  used.  The  disadvantages  of  using  an  aircraft  simulation  to 
investigate  the  dynamics  of  landing  gears  are  two  fold.  First, 
aircraft  simulations  typically  simplify  the  landing  gear  such 
that,  although  the  dynamics  of  the  aircraft  are  reasonably 
simulated,  the  particulars  of  the  landing  gear  may  not  be  well 
represented.  This  would  be  especially  true  for  extreme  cases 
where  landing  gear  nonlinearities  are  Important. 

The  second  disadvantage  of  a  full  aircraft  simulation  is 
that  it  simulates  the  full  aircraft,  meaning  that  much  more  input 
information  is  needed  to  perform  the  simulation,  much  more 
-computer  time  is  required,  and  the  resnlts  of  the  full  aircraft 
simulation  often  do  not  apply  to  the  test  results  of  a  single 
gear. 

To  overcome  the  disadvantages  mentioned  above,  simulations 
were  developed  to  predict  the  performance  of  a  single  landing 
gear.  The  first  simulation  predicts  the  pressures  and  forces  in 
the  gear  when  it  is  stroked  by  a  well-defined  displacement  input. 
It  includes  several  nonlinear  phenomena,  but  no  dynamics.  The 
second  simulation  is  that  of  a  cantilevered  nose  gear  traversing 
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a  well-defined  obstacle,  typical  of  what  would  be  expected  on  a 
bomb-damaged  repaired  (BDR)  runway.  It  includes  several 
nonlinear  phenomena,  and  the  dynamics  of  the  sprung  and  unsprung 
masses  associated  with  a  landing  gear.  While  the  simulation  was 
developed  to  be  generic,  it  has  some  specific  capabilities 
associated  with  the  F-16  nose  gear,  especially  the  Advanced 
Landing  Gear  System  (ALGS)  that  includes  the  jump  capability.  If 
the  jump  capability  is  desired,  a  third  degree  of  freedom  is 
simulated,  that  of  the  floating  jump  piston.  The  floating  jump 
piston  quickly  forces  the  hydraulic  fluid  into  the  pneumatic 
chamber,  causing  the  gear  to  lift  off  of  its  support. 

Testing  on  the  standand  F-16  gear  traversing  an  obstacle,  or 
on  the  ALGS  during  the  jump  phenomenon  has  been  accomplished  by 
the  Mechanical  Branch  and  results  can  be  compared  to  the 
simulations  developed  by  this  project.  The  computer  simulations 
can  then  be  used  to  investigate  the  effect  of  varying  hardware 
parameters  before  actual  tests  are  conducted.  Ideally,  tests  can 
be  used  to  verify  a  small  number  of  simulations,  and  then  the 
simulations  can  be  used  to  predict  a  wide  variety  of  conditions. 


ANALYTICAL  MODELS 

There  are  generally  three  types  of  forces  that  act 
internally  to  a  strut:  pneumatic  stiffness,  hydraulic  damping, 
and  bearing  friction.  In  addition,  on  the  ALGS  the  jump 

X 

capability  requires  the  simulation  of  a  pressure  buildup  that 


accelerates  a  floating  piston  in  the  hydraulic  chamber,  causing 


the  sprung  mass  (nose  weight)  to  accelerate  upward,  carrying  the 
unsprung  mass  (wheel)  with  it.  External  forces  acting  on  the 
system  include  weight;  lift;  the  force  in  the  tire  due  to  the 
system's  static  weight  and  lift,  traversing  an  obstacle,  or  being 
compressed  because  of  the  pressure  buildup  during  the  jump;  and 
atmospheric  pressure. 

The  pneumatic  stiffness  is  generated  by  compressing  a  gas, 
usually  air  or  nitrogen.  Because  of  the  high  pressures  sometimes 
generated  in  a  landing  gear,  the  simulations  also  include  the 
effect  of  compressing  the  hydraulic  fluid  and  expanding  the 
chamber.  Both  of  these  effects  are  nonlinear,  decrease  the 
pneumatic  pressure,  and  are  relatively  unimportant  at  normal 
pressures  in  the  gear.  But  they  are  significant  if  internal 
pressures  reach  several  thousand  psi. 

The  hydraulic  damping  is  a  result  of  forcing  fluid  through 
one  or  more  orifices.  Although  the  hydraulic  force  is 
proportional  to  the  square  of  the  relative  velocity  between  the 
pneumatic  and  hydraulic  chambers,  that  proportionality  can  be  a 
nonlinear  function  itself. 

The  bearing  friction  is  produced  at  the  seals  between  the 
pneumatic  and  hydraulic  chambers.  Associated  with  the  friction 
force  is  a  stiction  force  that  must  be  overcome  before  sliding 
motion  can  take  place  (static  friction). 

A  fourth  internal  force  present  in  the  ALGS  is  a  pressure 
buildup  under  a  floating  piston  in  the  hydraulic  chamber  that 
quickly  forces  all  the  hydraulic  fluid  into  the  pneumatic 
chamber,  causing  rapid  separation  of  the  two  chambers  and  liftoff 
of  the  gear. 
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A  schematic  of  the  F-16  landing  gear  is  shown  in  Figure  1 
with  appropriate  forces  and  pressures  indicated.  In  the  non jump 
mode,  the  floating  piston  rests  at  the  bottom  of  the  hydraulic 
chamber  and  has  no  effect  on  the  dynamics  of  the  gear.  This  is 
the  mode  of  operation  when  simulating  the  motion  of  the  gear 
traversing  an  obstacle.  On  the  other  hand,  if  the  jump  operation 
is  being  simulated,  the  floating  piston  is  accelerated  toward  the 
base  of  the  pneumatic  chamber  when  the  jump  pressure  is  greater 
than  the  hydraulic  pressure.  The  upward  motion  of  the  floating 
piston  increases  the  hydraulic  pressure,  increasing  the  flow  of 
oil  through  the  orifices,  and  causing  the  pneumatic  chamber  to 
extend  relative  to  the  hydraulic  chamber.  If  the  jump  pressure 
is  high  enough  and  is  maintained  long  enough,  the  tire  will  be 
compressed  greatly;  and  when  it  rebounds,  the  entire  gear  may 
lift  off  its  support. 

A  summary  of  the  nonlinear  phenomena  that  have  been  included 
in  this  analysis  is  given  in  Table  1. 


Table  1.  Nonlinear  capabilities  available 

1.  Polytropic  gas  in  pneumatic  chamber. 

2.  Variable  polytropic  gas  constant. 

3.  Hydraulic  damping  proportional  to  velocity  squared. 

4.  Hydraulic  discharge  coefficient  varies  with  metering  pin 
location. 

5.  Hydraulic  discharge  coefficient  varies  with  compression 


or  extension. 

6. 

Hydraulic  fluid 
temperature. 

density 

varies 

with 

pressure 

and 

7. 

Hydraulic  bulk 
temperature. 

modulus 

varies 

with 

pressure 

and 

8.  Hydraulic  fluid  is  compressible. 

9.  Gear  chamber  is  expandable. 

10.  Friction  and  stiction  forces  differ  in  compression  and 
extension. 

11.  Tire  can  traverse  Class  B  and  Class  E  obstacles. 

12.  Tire  force/displacement  relation  is  fifth  order. 

13.  Tire  bottoming-out  force  is  included. 
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Figure  1.  Schematic  of  F-16  ALGS. 
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14.  Tire  lift-off  is  possible. 

15.  Metal  on  metal  stiffness  force  is  generated  when  the 
chambers'  relative  position  is  fully  extended. 

16.  Dynamics  of  floating  jump  piston  depends  on  dynamics  of 
hydraulic  chamber,  the  hydraulic  and  jump  pressures,  or 
the  dynamics  of  the  pneumatic  chamber. 

Because  of  the  way  the  computer  code  is  written,  it  would  be 
relatively  easy  to  incorporate  other  nonlinearities  into  the 
program.  For  example,  the  program  uses  a  constant  hydraulic 
temperature  to  determine  fluid  density  and  bulk  modulus.  If 
desired,  a  variable  temperature  could  be  used  to  affect  a  change 
in  density  and  modulus  while  the  simulation  is  running. 

Equations  of  Motion 

The  dynamic  equations  of  motion  of  the  system  are  associated 
with  the  mass  of  the  wheel /tire/hydraulic  chamber,  the  effective 
mass  of  the  nose  of  the  aircraft/pneumatic  chamber,  and  the  mass 
of  the  floating  piston.  The  equation  of  motion  of  the  wheel  is: 

^w  *  ^w  ~  ^t  “  ^h  *  ^h  “  '^p  *  ^^p  ”  ^h^  ^a  *  ^p  “  ^f  ^ge  “  '^w 

(1) 

In  the  jump  mode,  if  Pj  >  Pj^,  the  floating  jump  piston  will 
lift  off  its  support  in  the  hydraulic  chamber  and  move  toward  the 
base  of  the  pneumatic  chamber.  For  this  case.  Equation  (1)  is 
revised  by  substituting  Pj  for  Pj^. 

In  Equation  ( 1 ) ,  the  friction  force  acts  in  the  direction 
opposing  the  relative  motion  between  the  pneumatic  and  hydraulic 
chambers  and  is  typically  a  different  value  for  compression  than 
for  extension.  In  addition,  the  friction  force  is  typically  high 
before  motion  starts  and  reduces  to  a  lower  value  once  the 
stiction  force  has  been  overcome.  The  second  last  term  on  the 
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right  hand  side  of  Equation  (1)  is  a  stiffness  force  that  exists 
when  the  pneumatic  chamber  has  fully  extended  itself  with  respect 
to  the  hydraulic  chamber.  From  a  practical  standpoint,  the  two 
chambers  cannot  separate  indefinitely.  When  the  maximum  gear 
extension  is  achieved,  a  very  stiff  linear  spring  is  engaged,  and 
the  upper  chamber  will  attempt  to  pull  the  lower  chamber  off  of 
its  support  (wheel  lift). 

The  equation  of  motion  for  the  upper  mass  (aircraft  nose)  is 
given  by: 

-  Pp  Ah  -  Pa  Ap  +  ''f  -  Pge  ’  «n  (2) 

In  the  jump  mode,  if  the  floating  piston  contacts  the  base 
of  the  pneumatic  chamber,  the  hydraulic  force  becomes  zero,  and 
the  jump  pressure  acts  directly  on  the  pneumatic  chamber.  For 
this  case.  Equation  (2)  is  revised  by  substituting  Pj  for  Pj^. 

The  equation  of  motion  for  the  floating  piston  is  given  by: 


Y  =  Y 
^p  *w 


if  Pj  <  Ph 


Mp  X  Yp  =  (Pj  -  P^)  X  -  Wp  if  Pj  >  Pj^ 


Yp  -  Y^  <  Sp^3^ 


(3a) 

(3b) 


Y 


P 


=  Y 


n 


if 


- 


Yw  > 


'pmax 


(3c) 


The  dynamics  of  the  floating  piston  is  equivalent  to  that  of 
the  wheel  until  the  jump  pressure  is  high  enough  to  overcome  the 
hydraulic  pressure.  As  the  floating  piston  moves  up,  it  forces 
the  hydraulic  fluid  through  the  orifice  in  the  base  of  the 
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pneumatic  chamber.  When  the  floating  piston  makes  contact  with 
the  base  of  the  pneumatic  chamber,  its  dynamics  is  assumed  to  be 
that  of  the  aircraft's  nose.  The  damping  force  due  to  hydraulic 
fluid  flow  through  the  orifice  is  also  set  equal  to  zero  at  this 


time. 


The  pressure  terms  in  Equations  (1)  and  (2)  can  be 


rearranged  to  give  the  following  forces: 


■'hp  '  (Ph  -  Pp)  *  Ah  +  (Pp  -  Pa)  Ap  (4) 

The  first  term  in  Equation  (4)  is  referred  to  as  the 
hydraulic  damping,  while  the  second  term  is  called  the  pneumatic 
stiffness.  is  replaced  by  Pj  during  some  phases  of  the  jump 
mode  as  explained  above  for  Equations  (1)  and  (2). 


Hydraulic 


Force 


For  an  incompressible  fluid,  Bernoulli's  equation  relates 
fluid  pressure  and  velocity  at  one  point  in  a  flow  to  any  other 
point  in  the  flow.  For  flow  through  the  orifice: 

Ph  -  Pp  =  (^o  -  vg)  " 

2 

For  incompressible  flow,  the  volume  flow  rate  is  constant. 
Therefore, 


^h  *  ^h  =  ^o  *  ^o 


Substituting  from  Equation  (6)  into  Equation  (3)  gives: 


Ph  -  pp  =  -  -h  - 
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Since  the  hydraulic  area  is  much  greater  than  the  orifice 


area.  Equation  (7)  reduces  to: 


Ph  -  Pp  =  X  vg  X 
2 


'o/ 


(8) 


If  the  flow  direction  changes,  the  direction  of  the 
hydraulic  damping  force  also  changes,  which  can  be  handled  as: 

’^h  - 


fh  =  (Ph  -  Pp)  ^  Aj,  =  /? 


Vh  ^ 

2  X 


(9) 


Equation  (9)  is  the  hydraulic  force  for  ideal  flow.  But 
there  are  losses  depending  on  the  orifice  geometry  and  other 
factors,  which  decrease  the  flow  rate.  The  ratio  of  the 
effective  flow  rate  to  the  ideal  flow  rate  is  determined 
experimentally  and  is  called  the  discharge  coefficient. 


Cd  = 


Qi 


Vo  ^  Aq 


oe 


(10) 


Typically,  the  hydraulic  force  is  modified  by  as  follows: 


= 


Vh  *  A^ 


(11) 


^h  '  2 

2  X  (Cjj  X  Aq)"^ 

The  expression  for  the  hydraulic  force  can  also  be  written 


as : 


=  Ck  X  Vv 


Vv 


(12) 


where:  =  a  function  of  stroke  if  one  of  several 

devices  is  used  to  vary  the  orifice  size 
or  flow  characteristic.  typically 
varies  as  a  function  of  stroke,  due  to  a 
metering  pin  and/or  the  opening  of  extra 
orifice  holes  in  extension. 


64-12 


Pneumatic  Stiffness  Force 

The  compression  of  the  gas  in  the  pneumatic  chamber  can  be 


represented  by  a  polytropic  process  where: 


(13) 


are: 


P  X  V  =  constant 

Ideal  values  for  the  polytropic  constant, 

•  Isothermal:  'i  =  1.0 

•Adiabatic:  y  =  1.4  (for  air  or  nitrogen) 

In  actual  practice,  depends  on  the  characteristics  of 

the  stroke,  e.g.  the  amount  of  oil  spraying  into  the  gas.  From 
Equation  (13)  the  pneumatic  pressure  during  a  stroke  is  given  by: 


=  P. 


V. 


V, 


(14) 


The  volume  of  gas  in  the  pneumatic  chamber  during  a  stroke 
is  given  by: 

'^p  =  ''i  '  Ap  X  S  (15) 

In  Equation  (15),  S  is  modified  if  the  floating  jump  piston 
causes  some  or  all  of  the  hydraulic  fluid  to  move  into  the 
pneumatic  chamber.  Substituting  Equation  (15)  into  Equation 
(14),  the  pneumatic  force  can  be  writteiT  as: 


Fp  =  CPp  -  Pa^  ^  Ap  = 


’„  /  ''i  ' 

Pi  X  - 

-  Pa 

\Vl  -  Ap  X  Sj 

X  A, 


(16) 


Chamber  Expansion  -  If  high  pressures  are  developed  in  a  thin- 
walled  chamber,  the  expansion  of  the  chamber  may  be  significant. 
The  change  in  volume  in  a  cylindrical  chamber  is  given  by: 


Ve  = 


TT  X  Pp  X  X  Le 
4  X  t^  X  E 


(17) 
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Hydraulic  Compressibility  -  At  high  pressures  a  hydraulic  fluid 
becomes  somewhat  compressible,  such  that  the  effective  pneumatic 
volume  will  be  increased.  The  compressibility,  which  is  the 
inverse  of  the  bulk  modulus,  is  defined  as  the  negative  of  the 
fractional  change  in  volume  per  unit  change  in  pressure,  or 


b  =  —  - - 

P  xAP 


(18) 


The  effective  decrease  in  hydraulic  volume  is  then  given  by; 


Vc  =  - 


Vh  ^  (Ph  -  Pa) 


(19) 


Corrected  Pneumatic  Force  -  If  both  chamber  expansion  and 
hydraulic  compressibility  effects  are  included  in  the  pneumatic 


stiffness  force.  Equation  (16)  is  modified  to: 


%  = 


V. 


-  P, 


Vi  -  Ap  X  S  +  V3  - 


X  A- 


(20) 


Since  both  the  expansion  of  the  pneumatic  chamber  and  the 
compressibility  of  the  hydraulic  fluid  depend  on  the  pneumatic 
pressure,  an  iteration  procedure  is  necessary  to  determine  the 
pneumatic  stiffness  force  in  Equation  (20). 


Friction/Stictlon 

To  analytically  define  the  friction  and  stiction  forces  in  a 
gear  can  be  an  extremely  complex  undertaking.  The  present 
simulation  provides  for  the  specification  of  different  friction 
and/or  stiction  forces  during  compression  or  extension.  If  a 
more  exact  formulation  is  required,  it  can  be  written  in  the 
friction  subroutine. 
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Hydraulic  Fluid  Density 

Typically,  the  density  of  the  hydraulic  fluid  is  taken  as  a 
constant.  Actually,  over  a  temperature  range  of  -100  F  to  500  F 
and  a  pressure  range  of  0  psi  to  5000  psi,  the  density  can  vary 
from  .027  to  .034  Ib/in^.  This  variation  is  represented  by; 

O'  =  0.03242  -  1.0575  x  10"^  x  T^ 

+  [8.6  X  10"®  +  3.3  X  10"^°  Tj^jx  (21) 

The  fluid  density  affects  the  hydraulic  damping  force;  see 
Equation  (11). 


Hydraulic  Fluid  Compressibility 

The  bulk  modulus  of  a  hydraulic  fluid  varies  by  an  order  of 
magnitude  over  a  temperature  range  of  -100  F  to  500  F,  and  a 
pressure  range  of  0  to  5000  psi  (50,000  to  500,000  psi).  This 
variation  for  hydraulic  fluid  MIL-H-5606B  is  represented  by; 


=  331,000  X  x 


1  - 


Ph  -  2000 


3000 

+  372,000  X  e”°*°0276T^  ^ 


Pj^  -  2000 


(22) 


3000 

The  bulk  modulus  affects  the  hydraulic  compressibility 
volume.  Equation  (19). 


Tire  Force 

For  small  tire  compression,  the  force/displacement 
characteristic  can  be  taken  as  linear.  However,  large  tire 
displacements  are  likely  for  both  motion  over  obstacles  and 
during  the  jump  mode.  Therefore,  the  characteristic  of  the  tire 
can  be  represented  by  a  fifth  order  relation  if  desired.  The 
equation  is; 
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(23) 


n= 


il  X  + 


^2  X  +  A 


3  X  D2  +  X  D 


t  *  ''5 


>? 


If  the  tire  bottoms  out,  the  stiffness  force  is  assumed  to 
be  the  series  sum  of  the  tire  force  at  its  bottomed-out 
displacement  plus  the  stiffness  force  caused  by  deflecting  the 
tire's  rim.  Also,  a  linear  damping  force  is  produced  by  a 
damping  coefficient  times  the  relative  velocity  between  the  wheel 
and  its  support  (runway). 


(Dij)  +  Kt,  X  (D^  -  D^)  +  X  (24) 

If  a  linear  tire  force/displacement  relation  is  desired,  use 
only  the  Aj^  coefficient  in  Equation  (23). 


Runway  Obstacles 

Three  types  of  obstacles  have  been  identified  as 
representative  of  a  BDR  runway.  A  no  obstacle  case  is  also 
available  to  be  used  when  simulating  the  jump  gear.  The  type  of 
obstacles  are  listed  below  and  shown  in  Figure  2. 

*  No  obstacle 

*  Multiple  Versine  (1  -  Cosine) 

*  Class  B  Bump 

*  Class  E  Bump 

The  vertical  displacement  and  velocity  of  the  multiple 
versine  obstacle  are  given  by: 


2  X  77*  X  Vi 


cos 


X  (T  -  T-) 


'w 


(25) 


Yq  = 


A  X  7r  X 


X  sin 


"w 


2  X  Tt  X  Vi 


X  (T  -  T-) 


(26) 
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c.  Class  E  Bump 


Figure  2.  Obstacle  profiles  for  BDR  runways. 
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The  vertical  displacement  and  velocity  of  a  Class  B  obstacle 


Yo  =  - 


AxrxVj^  /2x7rxVi 

X  sin|  -  X  (T  -  Tj^) 


'w 


"w 


(36) 


For  T2  <  T  <  T^: 
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«h  - 


Rh  ^  Vj 

Ri 


X  (T  -  T,) 


=  - 


Rh  X  Vi 


(37) 

(38) 


Numerical  Integration  Methods 

The  equations  of  motion  were  reduced  to  first  order 
differential  equations,  and  can  be  numerically  integrated  using 
one  of  the  following  four  methods: 

*  Runge-Kutta-Fehlberg 

*  Runge-Kutta  (fourth-order) 

*  Adams-Bashforth- Moulton 

*  Euler 

The  first  three  methods  incorporate  an  option  to  approximate 
the  error  and  Increase  or  decrease  the  time  increment. 
Theoretically,  this  technique  can  be  used  to  optimize  the 
accuracy  and  running  time.  The  first  three  methods  can  also  be 
run  at  a  fixed  time  increment  by  setting  an  input  parameter. 
Note,  automatic  time  step  variation  has  not  worked  with  this 
simulation,  possibly  because  of  the  nonlinearities. 


64- 1 9 


RESULTS 


Gear  Stroke  Simulation 

The  Gear  Stroke  Simulation  can  be  used  to  investigate  the 
pressures  and  forces  in  the  pneumatic  and  hydraulic  chambers  when 
subjected  to  relative  displacements  that  represent  the  motion  of 
a  gear  over  a  BDR  obstacle.  By  varying  the  amplitude,  length,  or 
wavelength  of  the  various  obstacle  profiles  provided,  different 
relative  displacements  and  velocities  between  the  pneumatic  and 
hydraulic  chambers  can  be  achieved.  As  a  result,  different 
pneumatic  and  hydraulic  forces  can  be  produced. 

The  program  can  also  be  used  to  investigate  the  effects  of 
chamber  expansion,  hydraulic  compressibility,  changes  in 
hydraulic  fluid  density,  changes  in  the  polytropic  constant, 
various  orifice  configurations,  and  the  basic  geometry  of  a  gear. 
In  addition,  the  program  was  written  and  annotated  in  such  a  way 
that  capability  or  variability  can  be  easily  added  by  the  user. 
The  input  data  are  described  in  APPENDIX  A;  a  set  of  input  data, 
as  it  should  appear  in  the  data  file  called  GEAR.DAT,  is  given  in 
APPENDIX  B;  the  output  data  are  described  in  APPENDIX  C;  a  sample 
output  run,  using  the  data  shown  in  APPENDIX  B,  is  given  in 
APPENDIX  D;  and  a  FORTRAN  listing  of  the  program  is  given  in 
APPENDIX  E. 

The  time  history  of  the  pneumatic  and  hydraulic  forces  are 
plotted  in  Figure  3  for  the  input  data  given  in  APPENDIX  B.  The 
simulated  obstacle  was  a  versine  of  amplitude  10.32  in.  (maximum 
gear  stroke)  and  a  wavelength  of  600  in.  The  simulated  velocity 
was  50  knots-  The  pneumatic  force  increases  monatonically  until 


Time .seconds 


Figure  3.  Time  histories  of  pneumatic  and  hydraulic  forces  for  F-16 
nose  gear  during  a  10.32  in.  versine  stroke. 


the  stroke  reaches  its  maximum  on  the  F-16  gear.  It  should  be 
noted  that  small  changes  in  the  polytropic  constant  have  a 
significant  effect  on  the  maximum  pneumatic  force.  The  hydraulic 
force  resembles  a  distorted  sine  function.  The  distortion  is 
caused  by  the  fact  that  the  effective  quadratic  damping 
coefficient  is  a  function  of  the  stroke.  There  are  times  at 
which  the  stroke  velocity  is  increasing,  but  the  damping 
coefficient  is  decreasing  causing  a  leveling  out  in  the  hydraulic 
force  (see  Appendix  D  at  times  0.06  and  0.08  s). 

F-16  Nose  Gear  Simulation 

The  second  computer  program  contains  the  features  of  the 
first  program  plus  much  more,  including:  the  dynamics  of  the 
wheel  and  the  simulated  nose  mass,  the  tire  flexibility,  and  the 
jump  capability.  The  program  runs  basically  in  one  of  two  modes: 
1)  dynamics  of  the  gear  traversing  an  obstacle,  and  2)  dynamics 
of  the  gear  during  the  jump.  Both  simulations  provide  the  time 
history  of  the  dynamics,  pressures,  and  forces  in  the  system.  As 
with  the  first  program,  this  simulation  was  written  and  annotated 
so  that  its  capabilities  could  be  expanded.  The  input  data  are 
described  in  APPENDIX  F;  sets  of  input,  for  both  modes  of 
operation,  are  given  in  APPENDIX  G;  the  output  data  are  described 
in  APPENDIX  H;  a  sample  output  run  for  each  mode  of  input  data, 
shown  in  APPENDIX  G,  is  given  in  APPENDIX  I;  and  a  FORTRAN 
listing  of  the  program  is  given  in  APPENDIX  J. 

Obstacle  Simulation  -  A  sample  simulation  of  the  nose  gear 
traversing  five  verslne  waveforms  was  performed.  J*  plot  of  the 
time  histories  of  the  stroke,  pneumatic  pressure,  hydraulic 
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pressure,  tire  load,  and  obstacle  profile  are  shown  in  Figure  4, 
along  with  a  comparison  to  test  data.  Of  the  four  comparisons, 
only  tire  load  is  well-correlated.  The  stroke  comparison.  Figure 
4a,  starts  out  well,  but  then  the  simulation  oscillates  about  a 
mean  value  that  indicates  that  the  pneumatic  and  hydraulic 
chambers  have  separated  somewhat  beyond  their  static  equilibrium 
position.  The  final  stroke  value  is  negative  because  the  large 
stiction  force  cannot  be  overcome. 

The  correlation  between  test  and  simulation  of  the  pneumatic 
pressure.  Figure  4b,  is  also  poor  because  of  the  low  amplitude 
changes  in  the  simulation.  However,  a  simple  calculation  with 
the  polytropic  gas  equation  (Equation  14)  indicates  that  a  1.0 
in.  stroke  at  0.15  seconds  should  correspond  to  an  increase  in 
pneumatic  pressure  from  937  to  1060  psi,  as  the  simulation 
results  indicates,  not  1600  psi  as  the  test  data  indicate.  The 
lower  final  pneumatic  pressure  in  the  simulation  can  be 
attributed  to  the  high  stiction  force. 

In  Figure  4c,  the  hydraulic  pressure  simulation  correlates 
somewhat  better  with  the  test  data.  It  should  be  noted  that  the 
test  data  for  the  pneumatic  and  hydraulic  data  are  almost 
identical.  This  does  not  seem  reasonable,  because  there  would  be 
no  flow  of  the  hydraulic  fluid,  and  therefore,  a  constant  stroke. 

In  Figure  4d,  the  simulated  tire  load  matches  very  well  with 
the  test  data,  until  the  fifth  versine  wavelength  is  negotiated. 
For  the  last  half  of  the  simulation  the  magnitude  is  similar,  but 
it  is  out  of  phase  with  the  test  data.  The  effective  frequency 
of  oscillation  of  the  simulated  gear  system  was  higher  than  the 
test  data  indicated.  This  suggests  a  difference  in  the 
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Obstacle  Tire  Hydraulic  Pneumatic 

Height, in.  Load.lbxlO  Pressure, ksi  ,  Pressure, ksi  Stroke, in 


stiffness,  damping  or  mass  properties  assumed  for  the  simulation, 
and  those  actually  used  in  the  test. 

Jump  Gear  Simulation  -  Sample  data  of  the  simulation  of  the  jump 
phenomenon  is  compared  to  test  data  in  Figures  5  and  6.  In 
Figure  5a,  the  simulated  stroke  correlated  well  with  test  data, 
except  that  the  time  to  full  extension  was  10  percent  quicker. 
The  simulated  hydraulic  pressure  difference  across  the  orifice 
(Figure  5b)  compared  well  to  test  data  in  magnitude,  but  the 
predicted  time  for  the  floating  jump  piston  to  push  all  the 
hydraulic  fluid  into  the  pneumatic  chamber  was  about  25  percent 
shorter  than  actual.  The  simulated  jump  pressure  (Figure  5c)  is 
input  data. 

In  Figure  6a,  the  simulated  nose  velocity  increased  somewhat 
quicker,  and  did  not  fall  off  as  quickly  as  the  test  results. 
Both  of  these  events  can  be  better  interpreted  by  considering 
Figure  6b  and  6c.  The  tire  load  (Figure  6b)  displays  a  large 
force  at  0.028  s.  This  large  force  was  caused  by  the  tire 
bottoming  out,  which  apparently  did  not  occur  during  the  test. 
The  large  tire  force  caused  a  more  severe  rebound,  and  therefore, 
accelerated  the  nose  at  a  higher  rate.  The  simulated  tire  load 
remained  consistently  above  the  test  results  up  to  0.22  s,  where 
it  fell  off  rapidly  until  wheel  lift  was  achieved  at  0.25  s, 
again  somewhat  sooner  than  the  test  results.  In  Figure  6c,  the 
simulated  nose  acceleration  correlates  reasonably  well  with  the 
high  side  of  the  test  results.  At  0.3  seconds  the  acceleration 
test  results  makes  an  abrupt  increase.  Since  the  system  is  in 
free  flight  at  this  time,  there  is  no  physical  justification  for 
this  increase.  In  addition,  simple  calculations  of  the  system  in 
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c.  Nose  weight  acceleration 
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Figure  6.  Comparison  of  test  and  simulation  time  histories  associated 
with  F-16  ALGS  nose  gear  during  a  jump 


free  flight  indicate  that  the  nose  acceleration,  with  only  weight 
and  lift  acting,  should  be  about  -0.4  g's,  as  the  test  results 


displays  between  0.27  and  0.30  seconds.  It  is  at  0.3  seconds 
that  the  test  nose  velocity  (Figure  6a)  falls  off  much  quicker 
than  the  simulation  results,  indicative  of  the  higher  negative 
nose  acceleration  shown  in  Figure  6c. 

CONCLUSIONS 


Large  computer  programs  that  simulate  the  dynamics  of  a 
complete  aircraft  are  often  used  to  investigate  or  design  a 
landing  gear.  While  this  approach  is  meaningful,  it  probably 
should  not  be  used  during  preliminary  design.  Typically,  an 
aircraft  simulation  does  not  include  many  of  the  details  of  the 
gear;  it  is  significantly  more  difficult  to  use  because  inputs, 
not  associated  with  the  gear,  must  be  determined;  and  the  output 
is  more  difficult  to  appreciate  because  much  of  it  is  associated 
with  the  aircraft  rather  than  the  gear.  For  preliminary  design, 
a  detailed  model  of  only  the  gear  will  produce  greater  insight 
into  its  performance  than  a  complete  aircraft  model.  Also,  a 
single  gear  simulation  can  be  used  to  compare  to  laboratory  test 
data. 

With  these  incentives  in  mind,  two  computer  models  were 
developed.  While  many  of  the  features  of  both  models  are  generic 
to  landing  gears,  several  of  the  features  are  particular  to  the 
F-16  nose  gear  and  the  ALGS  developed  for  the  nose  gear  of  the  F- 
16.  The  first  model  is  essentially  kinematic  in  nature,  i.e., 
the  pneumatic  chamber  is  fixed,  and  the  hydraulic  chamber  is 
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stroked  according  to  a  well-defined  simulated  BDR  obstacle.  The 
program  can  be  used  to  investigate  basic  geometries,  varying 
orifice  areas,  and  several  nonlinearities  associated  with  gears. 
Results  of  the  program  can  be  compared  to  test  data  generated  on 
a  machine  that  strokes  the  gear. 

The  second  computer  program  simulates  the  dynamics  of  a 
wheel,  gear,  and  effective  spring  mass.  The  output  is  produced 
by  numerically  integrating  the  differential  equations  of  motion. 
The  dynamic  model  contains  all  of  the  features  modeled  in  the 
first  model,  plus  several  additional  nonlinearities  summarized  in 
Table  1.  The  model  simulates  a  wheel/gear  traversing  an  obstacle 
or  the  jump  phenomenon  associated  with  the  ALGS. 

The  results  of  the  first  model  was  shown  to  produce  a 
realistic  pneumatic  pressure  versus  stroke  characteristic.  Figure 
3.  Experience  with  the  model  quickly  gives  the  user  an 
appreciation  that  total  forces  are  sensitive  to  several  system 
parameters  which  are  difficult  to  determine,  e.g.,  the  polytropic 
gas  constant. 

The  comparison  of  the  dynamic  model's  results  to  actual  test 
data  were  mixed.  In  several  cases  the  correlation  was  good;  at 
other  times  the  correlation  was  poor.  However,  as  was  pointed 
out  in  the  results,  the  test  results  seemed  to  be  inaccurate  in  a 
couple  of  places.  The  simulation  results  were  good  enough, 
however,  to  encourage  further  use  of  the  model,  and  to  attempt  to 
determine  exactly  what  caused  the  difference  between  the 
simulation  and  test  results. 
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RECOMMENDATIONS 


The  two  computer  simulations  developed  on  this  project 
seemed  to  give  reasonable  results,  and  in  several  cases 
correlated  very  well  with  test  data.  However,  there  still  exists 
a  need  to  determine  the  cause  of  the  discrepancies  between  the 
test  and  simulation  results.  Likely  causes  are  a 
misunderstanding  of  the  exact  nature  of  the  test  conditions,  and 
small  errors  in  the  analytical  model  or  computer  code. 

Further  efforts  should  be  made  to  validate  the  computer 
codes  with  well-defined  test  cases.  Working  with  the  codes  in 
conjunction  with  a  test  program  would  provide  needed  familiarity 
and  give  the  user  a  chance  to  try  various  options,  and  possibly 
add  some  additional  capabilities. 
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PREFACE 


A  summary  of  the  work  completed  for  the  minicrant  from  the  Air 
Force  Office  of  Scientific  Research,  Cntract  No.  F 4 96 20-85-C-0013/ 
SE5851-0360,  during  the  period  April  1,  1987  to  March  31,  1988.  The 
primary  aim  of  the  project  has  been  to  evaluate  actively  cooled 
structures,  i.e.  convection  heat  transfer  through  complex  passages. 
Two  geometries  have  been  studied  quite  extensively.  They  are  pin 
fin  and  rectangular  plate  fin  geometries.  In  addition  the 
mathematical  formulation  for  finite  element  implementation  of  the 
k- ^ turbulence  model  is  in  place. 
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x,y  =  coordinate  directions 
u  =  velocity  in  direction  x 

V  =  velocity  in  direction  y 

T  «  temperature 

p  *  pressure 

5  =  density 


=  Average  velocity  at  minimum  cross 
section  of  cylinders 


=  Drag  Coefficient 
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Re  =  Reynolds  number  = 


s  =  arc  Length  (measured  along  the 
circumference  of  each  cylinder) 

=  shape  functions  for  velocity, 
(temperature)  and  pressure. 

-■  penalty  parameter 
%•  =  vector  of  unitnowns 
Pe  =  Peclet  number  - 

-  domain  and  boundary  of  a 
finite  element 

4  =  turbulence  Icinetic  energy 

€  =  dissipation  rate  of  turbulence 

kinetic  energy 

C_  t  -  constants  for  the  k-^  model 
L,H  =  parameters  for  rectangular  fins 
6^0,5  =  parameters  for  diamond  shape  fins 
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A  study  of  the  performance  of  active  cooling  devices  requires 
the  solution  of  Navier  Stokes  equations  including  the  effects  of 
turbulence.  Since  the  equations  are  coupled  and  nonlinear  exact 
solutions  have  been  possible  only  in  a  few  simple  cases.  For  prob¬ 
lems  of  practical  interest,  only  numerical  solutions  are  possible. 
For  active  cooling  devices,  because  of  the  complex  geometries, 
separation,  reattachment,  and  interaction  of  boundary  layers  numer¬ 
ical  studies  have  been  quite  limited.  Hence  there  have  been  several 
studies  [1-8]  using  experimental  methods.  With  the  advent  of  super¬ 
computers,  there  have  also  been  some  studies  to  utilize  the  power 
of  computational  fluid  dynamics  techniques  in  the  modeling  of  con¬ 
vective  heat  transfer.  The  problems  addressed  were  cooling  tower 
studies  [9,10],  right  angled  bends  [11,12],  rectangular  pin 
geometries  [13-16]. 

Among  the  numerical  techniques  used  to  solve  fluid  mechanics 
and  heat  transfer  problems  finite  difference  and  finite  element 
methods  are  most  common.  Of  these,  the  former  because  of  its  sim¬ 
plicity  has  found  wider  applicability.  However,  the  finite  element 
method  has  some  advantages  to  offer;  for  example,  the  ease  w;th 
which  complex  geometries  and  boundary  conditions  can  be  modeled, 
sound  mathematical  foundation,  the  simplicity  with  wt’ch  phase 
changes  can  be  included  etc.  Hence,  the  use  of  finite  element 


methods  in  increasing  in  fluid  mechanics. 


In  this  report,  a  summary  of  the  work  performed  in  the  study 
of  the  application  of  finite  element  method  to  fluid  mechanics 
problems  is  presented.  The  project  consisted  of  two  phases.  The 
first  phase  is  the  numerical  computation  of  the  various  geometries 
and  the  second  phase  is  the  inclusion  of  the  turbulence  model  into 
the  computer  code.  The  computer  program  used  in  this  study  is  NON¬ 
FLAP,  a  general  purpose  finite  element  program  to  analyze  fluid 
mechanics  and  heat  transfer  problems  developed  by  Murty  [17].  The 
two  phases  of  the  project  proceeded  simultaneously,  in  that  the 
evaluation  of  various  turbulence  models  took  place  in  parallel  with 
the  computer  runs  of  the  fin  geometries.  In  the  rest  of  this 
report,  a  description  of  the  governing  equations  and  formulation  is 
presented  followed  by  numerical  results  and  conclusions  and  recom¬ 
mendations  for  further  work. 
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GOVEBNING  EQUATIONS  AND  FORMULATION 


The  equations  which  govern  the  flow  of  steady,  incompressible, 
Newtonian,  two  dimensional  flows  of  Newtonian  fluids  are  the  con¬ 
servation  of  mass,  linear  momentum,  and  energy.  They  are  given  as 
f ol lows ; 
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The  Doundary  conditions  are 
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The  fluid  domain  and  the  boundary  conditions  are  shown  in  Fig.  1. 
The  first  step  in  the  solution  of  Eqs.  (1)  -  (5)  using  the  finite 
element  method  is  to  obtain  a  variational  statement  of  the  problem. 
This  is  done  by  taking  the  inner  product  of  Eqs.  (1)  -  (4)  with  p, 
u,  v^,  and  T  respectively.  Thus  we  get 
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Eqs.  (7)  -  (9)  can  be  simplified  by  using  the  Gauss 
theorem  on  the  diffusion  terms  and  written  as 
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Eqs.  (6),  and  (9)  -  (11)  are  valid  over  the  entire  fluid  domain  _0.. 
In  the  finite  element  method  the  above  equations  are  written  for  a 
small  subdomain  called  the  finite  element  (see  Fig.  2). 

Mixed  Method: 

The  dependent  variables  to  be  interpolated  in  this  case  are  u,  v, 
p,  and  T.  Over  each  element  they  are  approximated  as  follows: 


•  -r--  t  - 

p: 

If  Eq.  (13)  is  substituted  into  Eqs.  (6), 
set  of  matrix  equations  is  obtained 

(C(q)  +  K1  {q{  = 

where 
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It  should  be  pointed  out  that  since  the  governing  equations  are 
nonlinear,  Picard's  method  of  successive  substitutions  has  been 
used  to  solve  Eq.  (14).  The  starting  vectors  for  the  iteration 
process  are  creeping  flow  for  the  hydrodynamic  part  and  diffusion 
solution  for  thermal  part.  The  following  convergence  criterion  has 
been  used  for  temperature  for  iteration  number  j.  Similar  criteria 
have  been  on  the  other  dependent  variables  also. 
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The  convergence  tolerance  (=  1%  used  in  this  study)  can  be  made  as 
small  as  necessary.  However,  it  was  found  during  the  investigation 
that  smaller  values  {<  1%)  did  not  significantly  improve  the  solu¬ 
tion,  but  tremendously  increased  the  computational  effort. 

Penalty  Method; 

As  opposed  to  the  mixed  method,  here  pressure  does  not  appear 
explicitly  as  an  interpolation  variable.  Hence  this  method  results 
in  fewer  degrees  of  freedom  per  element  and  consequently  less  com¬ 
putational  effort.  In  this  method  the  continuity  equation  is 
treated  like  a  constraint  which  must  be  satisfied  by  the  velocity 
field.  By  introducing  Eq.  (13)  into  the  penalized  variational 
statement  of  Eqs.  (1)  -  (4),  we  get  the  following; 

[C(g)  -t-  K  +  K'  Hq}  =  IF}  (17) 
where  the  matrices  C,  K,  K',  and  J"  are  given  by 
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The  matrices  C  and  F  are  given  in  Eq.  (15).  The  integrands  in  Eqs. 
(15)  and  (18)  are  evaluated  numerically  due  to  two  reasons:  alge¬ 
braic  complexity,  and  irregularity  of  the  domains.  The  usual  pro¬ 
cedure  is  to  use  Gaussian  quadrature  to  evaluate  them.  Any  polyno¬ 
mial  can  be  exactly  integrated  if  a  high  enough  order  of  quadrature 
is  used.  However,  in  the  context  of  penalty  methods  for  incompres¬ 
sible  flow  problems,  a  difficulty  arises  as  will  be  explained 
below.  Eq.  (1)  which  denotes  the  conservation  of  mass  can  be 
viewed  as  a  constraint  to  be  satisfied  by  the  velocity  field.  From 
a  computational  point  of  view  such  constraints  can  be  satisfied  by 
using  either  the  method  of  Lagrange  multipliers  or  penalty  method. 
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The  former  results  in  the  mixed  method  in  which  the  pressure  can  be 
identified  with  the  Lagrange  multiplier.  In  the  penalty  method  how¬ 
ever,  the  mass  conservation  is  satisfied  by  the  penalized  varia¬ 
tional  statement  for  Eq.  (1)  as  given  by  Eq.  (17).  The  continuity 

equation  is  exactly  satisfied  as  the  penalty  parameter  - >  . 

However,  as  increases ,  the  continuity  part  dominates  and  we 
obtain  only  the  trivial  result  (no  flow  as  can  be  seen  from  Eq. 
(17)).  Tc  remedy  this,  it  is  necessary  to  impose  a  singularity  in 
F  '  ,  so  that 

[K*  1  iq}  =  0  but  {q}  5^  0  (19) 

Such  singularities  can  be  easily  introduced  by  evaluating  K'  using 
low  or  reduced  integration  orders.  (See  Zienkiewicz  [18]).  Thus 
while  the  matrices  C  and  K  are  to  be  evaluated  with  a  high  degree 
of  accuracy,  must  be  evaluated  to  a  lower  order.  In  this  st.Jy, 
3X3  integration  is  used  for  C,  K,  and  F  while  a  2  X  2  scheme  is 
used  for  .  This  has  worked  quite-  satisfactorily  in  the  present 
study.  Finally,  the  magnitude  of  penalty  parameter  should  be  as 
large  as  possible.  However,  the  word  length  of  the  computer  imposes 
a  limitation  on  this  parameter.  For  small  values  (less  than  10*  ) 
the  solution  is  effected.  For  larger  values,  it  is  found  that  con¬ 
vergence  becomes  difficult.  The  effects  of  variation  of  the  penalty 
parameter  on  the  convergence  of  the  numerical  scheme  and  the 
overall  solution  are  demonstrated.  The  convergence  criterion  is 
the  same  as  used  for  the  mixed  method  (Eq.  (16)) 
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Turbulence  Model: 


The  need  of  turbulence  modeling  in  the  analysis  of  heat 
exchangers  can  be  hardly  overemphasized.  Usually,  higher  values  of 
friction  drag  and  p-essure  drop  are  associated  with  turbulent 
flows,  which  result  in  increased  mixing  and  help  diffuse  a  quantity 
faster  in  the  flow.  Besides,  it  is  quite  well  known  in  fluid 
mechanics  literature,  that  laminar  flows  in  nature  are  more  an 
exception  than  a  rule.  Numerical  modeling  of  turbulent  flows  usu¬ 
ally  follows  the  solution  of  Navier  Stokes  equations  with  Reynolds 
stresses  included  in  them.  In  order  to  predict  turbulent  flows 
numerically,  it  becomes  necessary  to  make  closure  assumptions  about 
the  apparent  turbulent  stresses  and  heat  flux  equations.  It  must  be 
noted  however,  that  all  presently  known  turbulence  models  have  lim¬ 
itations.  The  ultimate  turbulence  model  is  yet  to  be  developed.  A 
g d  discussion  on  the  various  turbulence  models  and  the  associated 
numerical  models  can  be  found  in  references  [19-23].  In  this  inves¬ 
tigation,  the  model  adopted  is  the  two  equation  or  k-e  model  (see 
[19-20]  for  details).  In  this  model  in  addition  the  usual  govern¬ 
ing  equations  for  pressure,  velocities,  and  temperature  (in  the 
form  of  continuity,  linear  momentum  and  energy)  two  other  equations 
need  to  be  added.  They  are  the  turbulence  energy  Tc,  and  the  dissi¬ 
pation  rate  €  equations  which  are  given  as  follows: 
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The  element  analogue  to  the  above  is  obtained  in  the  usur  I 

way.  First  the  variational  statement  corresponding  to  the  above 
equations  is  obtained  for  the  whole  domain  which  is  later  special¬ 
ized  for  a  particular  element.  Then  the  above  equations  are  com¬ 
bined  with  Eqs .  (10)  -  (12)  to  obtain  a  set  of  equations.  These 
when  combined  a  suitable  set  of  boundary  conditions  completes  the 
continuum  description.  The  constants  in  the  k-e  model  are  as  fol¬ 
lows  ; 
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Similar  to  laminar  flow  formulation  wherein  the  dependent  variables 
were  interpolated  over  a  typical  element  as  in  Eq.  (13),  in  tur¬ 
bulent  flows  there  are  additional  nodal  point  variables  correspond¬ 
ing  to  the  turbulence  energy  k  and  dissipation  e.  A  typical  element 
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with  quadratic  interpolation  for  these  variables  is  shown  in  Fic. 
3.  The  rest  of  the  formulation  follows  along  the  same  lines  as  any 
other  finite  element  formulation  for  continuum  problems,  i.e.,  gen¬ 
eration  of  the  element  matrices,  assembly,  imposition  of  boundary 
conditions,  solution  of  the  matrix  equations,  and  finally  post  pro¬ 
cessing.  At  present  NONFLAP  is  being  modified  to  include  turbulence 
modeling  capability. 
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NUMERICAL  RESULTS 


In  this  section  numerical  results  are  presented.  This  investi~ 
gation  concentrated  primarily  or.  the  applications  of  NONFLAP  to  the 
numerical  solutions  of  heat  exchangers  of  particLlar  geometries. 
These  include  pin-fin,  rectangular  plate-fin,  and  diamond  shaped 
fins.  Since  many  of  the  results  from  this  research  will  be  pub¬ 
lished  elsewhere  (some  have  already  been  published;  see  [24-2il), 
only  the  highlights  of  the  research  will  be  presented  here. 

A  schematic  diagram  cf  the  in  line  and  offset  pin-fin 
geometries  is  shown  in  Fig.  4.  It  is  to  be  noted  that  the  lines 
joining  the  centres  of  the  cylinders  intersect  each  at  right  angles 

A 

for  all  values  of  P/D.  The  computational  domain  is  denoted  by  the 
shaded  area.  The  length  of  the  domain  in  both  case-  is  three  rows 
deep.  Three  values  of  P/D  have  been  used;  1.5,  1.25,  and  1.75  and 
four  different  values  of  Pr  ;  0.005,  0.7,  5,  and  100.  The  details 
of  the  finite  element  mesh  and  the  boundary  conditions  are  shown  in 
Fig.  5.  The  mesh  consists  of  480  eight  noded  elements  for  which  the 
velocities  and  temperatures  are  interpolated  guadratical ly  and  the 
pressure  is  interpolated  linearly.  The  upstream  and  downstream 
lengths  were  chosen  to  be  about  five  diameters  long.  The  gradation 
of  the  mesh  near  the  cylinders  is  shown  Fig.  5b.  A  similar  grid  has 
been  used  to  investigate  offset  geometry  also,  but  the  total  number 
of  elements  used  was  1224. 
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Ficjiire  5:  F  in  i  te  el  emen  t  mesh  with  boundary  cond  i  t  i  otu; 

a)  Entire  qrld  and  b)  Detail  near  the  cylinders 


Results  for  in  line  geometry 


ferential  variation  of  Niisselt  number  for 
e  qeomotry  for  Ro  =  120,  Pr  =  0.7,  and 


Circumferential  variation  of  Niisselt  number  for 
offset  geometry  for  Re  =  120,  Pr  =,0.7,  and 
P/D  =  1.5  -  leff  side 


Figure  lOa:  Variation  of  drag  coerficicnt  v?;  Reynolds  number  for 
in  line  geometry 


The  plots  of  isotherms#  streamlines,  and  isobars  for  in  line 
geometry  for  F/D  of  1.5  and  Re  and  Pr  of  120  and  0.7  respectively 
are  shown  in  Fig.  6.  All  the  plots  represent  isolines  of  constant 
increments.  The  details  of  the  streamlines  and  and  isotherms  near 
the  cylinders  are  shown  in  Fig.  6d  and  6e.  Similar  plots  are  shown 
in  Fig.  7  for  offset  geometry  for  Re  =  60.  The  significant  feature 
for  this  geometry  is  the  near  total  mixing  with  practically  no 
stagnant  fluid  anywhere.  Notice  the  recirculation  region  here  also 
after  the  last  cylinder.  T;.e  variation  of  local  Nusselt  numbers  for 
each  of  the  three  cylinders  for  in  line  and  offset  geometries  is 
shown  in  Fig.  8  and  9.  It  can  be  seen  that  as  opposed  to  the  in 
line  case,  the  maximum  heat  transfer  occurs  upstream  of  the 
shoulder  and  the  drop  is  also  more  gradual,  indicating  that  the 
convection  mode  is  quite  dominant.  The  maximum  and  average  values 
are  both  higher  than  the  corresponding  in  line  geometry. 

An  important  variable  which  has  an  impact  on  pumping  power  is 
the  pressure  drop  between  tube  rows.  The  variation  of  the  drag 
coefficient  with  Re  for  both  geometries  is  shown  in  Fig.  10  and 
compared  with  results  from  other  investigators.  The  agreement  is 
quite  reasonable.  One  trend  that  can  be  seen  is  the  increase  in  the 
value  of  drag  coefficient  as  P/D  decreases  especially  for  the  in 
line  configuration.  This  is  not  unexpected  since  the  closer  the 
corrugations  in  a  flow,  the  higher  the  flow  resistance  and  hence 
the  higher  the  pressure  drop. 
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The  same  problem  was  attempted  using  the  penalty  finite  ele- 

C  Ig- 

ment  model.  The  penalty  parameter  is  varied  from  10  tc  10  .  It 

/  if. 

was  found  that  for  low  values  of  the  penalty  parameter  (upto  10  ) 

it  took  sxx  iterations  for  the  method  to  converge;  however,  for  10*^ 
,  the  method  did  not  converge  in  20  iterations.  The  difference  in 
the  solutions  between  both  the  methods  was  found  to  be  less  than 
half  percent.  Variations  of  axial  velocity  and  temperature  at  dif¬ 
ferent  locations  in  the  configuration  are  shown  in  Fig.  11.  The 
temperatures  are  shown  at  three  locations  (the  shoulders  of  the 
three  cylinders)  while  the  velocity  is  shown  at  only  one  location. 
This  has  been  done  because  the  velocity  was  fully  developed  for 
this  value  of  Re  after  the  first  cylinder  itself.  The  results  plot- 

-I  C 

ted  are  for  ^  «  10  and  10*  ,  and  the  mixed  method.  It  is  clear 

that  the  results  are  identical.  As  the  penalty  parameter 

decreases,  the  incompressibility  condition  tends  to  be  violated, 

the  value  of  K'  would  be  of  the  same  magnitude  of  C  and  K  and  thus 

would  produce  erroneous  results  for  the  velocity  field.  The  lowest 

and  highest  values  of  for  which  converged  results  can  be 

C  lif. 

obtained  for  this  class  of  problems  are  10  and  10  respectively. 

Next,  rectangular  plate  fin  heat  exchangers  were  analyzed. 
The  geometries  of  in  line  and  offset  fins  are  shown  in  Fig.  12. 
Since  it  was  found  that  both  penalty  and  mixed  methods  gave  identi¬ 
cal  results,  this  problem  was  solved  using  the  penalty  method  only. 
The  grids  for  in  line  and  offset  geometries  are  shown  in  Fig.  13. 
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□  PENALTY 
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Fiqure  11:  Conpar ifjon  of  results  for  penalty  and  iiP  xod  tneLhods:  variat  ion 
of  temperature  at  .tlie  slioulders  of  cylinders 


The  results  in  the  form  of  streamlines  and  isotherms  are  shown  in 
Fig.  14.  Notice  the  boundary  layer  separation  and  the  wide  recircu¬ 
lating  region  even  for  low  values  of  Re.  Similar  results  were 
obtained  for  offset  fin  geometry  also. 

In  order  to  investigate  the  capabilities  of  the  computer  code 
NONFLAP  another  problem  involving  a  different  geometry  was  studied. 
This  problem  uses  fins  shaped  in  the  form  a  diamond  placed  in  an 
offset  array.  The  physical  problem  is  shown  in  Fig.  15.  Of  interest 
here  are  the  effects  of  the  included  angle  and  aspect  ratio  s/a  on 
the  heat  transfer  characteristics.  This  problem  was  solved  for  air 
with  Pr  *  0.7  and  various  values  of  Re.  Typical  streamlines  and 
isotherms  are  shown  in  Fig.  16.  Variation  of  the  average  Nusselt 
number  with  Re  is  shown  in  Fig.  17.  The  last  case  studied  involves 
the  heat  transfer  characteristics  of  a  fluid  through  a  1 80  degree 
channel.  Typical  results  for  this  case  for  different  values  of 
Peclet  number  are  shown  in  Fig.  18.  It  was  found  that  recirculation 
eddies  formed  not  only  on  the  inner  wall  on  the  exit  side-  but  also 
on  the  outer  wall  on  the  inlet  side. 
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Nunerical  results  for  in  line  plate  fin 

a,  b,  c,  d  -  streamlines 

e,  f,  g,  h  -  isotherms  for  different 

values  of  t/L  ratios 


Figure  Ic:  Diamond  shape  fin  problem:  a)  fin-te  element  grid 

'  b)  -  i)  results  for  s/a  =  1.5,  and  r  =  90  dec. 

b)  — -  e)  streamlines  and  f)  i)  isotherms 

for  Re  =  10,  20,  50  and  100  respectively 
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CONCLUSIONS  AND  RECOMMENDATIONS 


The  finite  element  method  has  been  used  to  analyze  active 
cooling  systems.  The  procedure  involved  the  solution  of  incompres¬ 
sible  and  steady  Navier  Stokes  equations  for  laminar  Newtonian 
fluid  flow.  Two  formulations  have  been  given  herein.  The  first  for¬ 
mulation  is  the  mixed  or  Lagrangian  formulation  in  which  pressure 
appears  explicitly  as  a  nodal  point  variable.  The  second  formula¬ 
tion  is  the  penalty  method  in  which  the  continuity  equation  is 
treated  as  a  constraint  on  the  velocity  field.  In  this  formulation 
pressure  does  not  appear  as  an  interpolation  variable.  Hence  the 
total  degrees  of  freedom  in  this  case  is  less.  During  the  course  of 
this  investigation,  it  was  found  that  the  highest  value  of  penalty 

(if. 

parameter  for  which  converged  values  were  obtained  was  10  .  When 

6 

the  value  was  10  ,  the  solution  was  affected.  Hence,  for  the  solu¬ 
tion  of  Navier  Stokes  equations  using  the  finite  element  method, 

S  to 

the  penalty  method  with  a  penalty  parameter  of  10  to  10  is 
recommended.  The  application  of  the  same  method  to  turbulent  flows 
is  quite  straight  forward.  However,  care  must  be  taken  to  pick  a 
suitable  turbulence  model  since  no  single  model  exists  which  can 
accurately  and  satisfactorily  describe  turbulent  flows.  In  this 
study  the  model  which  has  the  widest  application  in  two  dimensional 
flows,  namely  the  k-€  model  has  been  used. 

Several  heat  exchanger  configurations  have  been  investigated. 
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It  should  be  pointed  out  that  among  all  the  geometries,  the  diamond 
shape  is  the  easiest  to  manufacture.  Although  all  of  the  geometries 
have  been  studied,  a  comparison  of  the  heat  transfer  characteris¬ 
tics  remains  to  be  performed  so  that  selection  among  them  could  be 
simplified.  From  preliminary  studies,  it  can  be  concludtd  with  con¬ 
fidence  that  offset  geometries  give  better  heat  transfer  perfor¬ 
mance. 

There  are  several  areas  into  which  this  study  can  be  extended. 
The  most  immediate  and  natural  extension  to  the  present  investiga¬ 
tion  is  the  inclusion  of  natural  convection  into  the  model.  This  is 
quite  important  since  at  low  values  of  inlet  flows,  natural  convec¬ 
tion  can  contribute  significantly  to  the  heat  transfer.  The  inclu¬ 
sion  of  this  into  the  finite  element  model  is  very  straight  for¬ 
ward.  When  this  is  accomplished,  the  various  geometries  can  be  stu¬ 
died  for  several  values  of  Rayleigh  numbers,  spacing  and  size  of 
fins,  Prandtl  numbers,  Reynolds  numbers,  etc.  The  results  presented 
in  this  study  have  a  direct  bearing  on  cooling  of  electronic  com¬ 
ponents.  Due  to  the  small  size  of  the  passages,  the  flow  is  often 
laminar . 

The  next  area  into  which  this  research  must  be  extended  is  the 
evaluation  of  the  various  turbulence  models  available  in  the 
literature.  While  the  number  is  large,  each  model  has  limited 
application.  A  detailed  study  of  these  models  will  yield  valuable 
information  as  to  the  uses  and  limitations  of  computational  fluid 
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dynamics  (involving  turbulent  flows)  in  active  cooling  devices. 
Because  of  the  versatility  of  the  finite  element  method,  fluid  flow 
and  heat  transfer  through  porous  media  with  and  without  phase 
change  can  be  easily  undertaken.  This  will  facilitate  the  study 
se  eral  areas  of  practical  interest  to  aircraft  heat  exchangers 
like  heat  pipes,  transpiration  cooling,  leading  edge  cooling,  etc. 
If  the  Navier  Stokes  equations  are  combined  with  equations  govern¬ 
ing  changes  of  concentration  (i.e.  mass  transfer),  boiloff  and 
rollover  problems  in  fuel  tanks  can  be  studied.  The  inclusion  of 
compressibility  effects  and  three  dimensional  flows  is  also  very 
important  in  applying  the  power  of  computational  methods  to  solve 
practical  problems. 
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1 .  Abstract 


Numerical  solution  of  two-dimensional  Navier-Stokes  equations  are  presented 
for  flow  past  a  flat  plate  for  Mach  numbers  equal  to  7  and  19.2  and  Reynolds 

3 

numbers  in  the  range  10  to  10  .  The  explicit  time -dependent  MacCormack 
finite -difference  numerical  scheme  is  used  to  integrate  the  equations  under 
appropriate  velocity-slip  and  temperature -jump  conditions  valid  for  the 
h3^ersonic  slip- flow  regime.  It  is  found  that  the  skin- friction  coefficients 
and  Nusselt  ntimber  results  agree  reasonably  well  with  other  published 
numerical  and  experimental  results  for  Reynolds  number  equal  to  and  or  greater 
than  50,  but  for  less  than  50,  there  is  considerable  disagreement.  The 
radiation  heat  transfer  effects  for  thin  and  gray  gas  approximations  are  also 
discussed. 
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Subscripts 


w 


2 ■  Nomenclature 

Speed  of  sound 
Skin  friction  coefficient 
Specific  heat  at  constant  volume 
Specific  internal  energy 
Vector  fluxes 

Molecular  heat  conductivity 

Mach  number 

Static  pressure 

Heat  transfer  rate 

Reynolds  number 

time 

Static  temperature 

Velocity  components  in  cartesian  coordinates 
Vector  flux 
Cartesian  coordinates 

Transformed  coordinate  system 
Density 

Coefficient  of  viscosity 
Optical  thickness 
Viscous  stress  components 
Absorption  coefficient 
Emissivity  coefficient 

Property  evaluated  at  free  stream  conditions 
Property  evaluated  at  the  plate  surface 
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3  ■  Introduction 

The  hypersonic  flow  problems  have  recently  attracted  much  attention 
because  of  the  design  of  trans- atmospheric  vehicles  (TAV) ,  the  hypersonic 
boost  glide  vehicles  (HBGV)  and  the  national  aerospace  plane  (NASP) .  These 
have  been  published  in  five  major  text  books  [1-5]  and  many  survey  articles 
[6-9]  on  the  hypersonic  flows  describe  the  characteristics  in  detail. 

The  problem  of  rarefield  hypersonic  flow  past  a  flat  plate  has  been 
studied  by  many  research  workers  for  more  than  two  decades.  Extensive  reviews 
on  these  studies  have  been  given  by  Potter  [10],  Moulic  [11],  Anderson  [6]  and 
Chow  [12].  Results  for  the  free  molecular  flow  existing  at  the  very  tip  of 
the  plate  were  presented  by  Schaaf  and  Chambre  [13] ,  Liu  [14]  and  Charwat 
[15].  The  Monte  Carlo  simulation  for  the  free  molecule  flat  plate  flow  was 
carried  out  by  Bird  [16].  He  indicated  that  up  to  the  smallest  Knudsen  number 
which  he  investigated,  the  over-all  flow  did  not  show  any  resemblance  to  the 
corresponding  continuum  state.  The  effect  of  velocity- slip  and 
temperature -jump  on  the  wall  has  been  studied  by  Oguchi  [17]  and  Pan  and 
Probstein  [18].  Several  authors  [19-24]  presented  their  experimental  findings 
for  hypersonic  flat  plate  flow,  but  the  experimental  results  produced  from 
hypersonic  low-density  wind-tunnel  tests  on  a  flat  plate  do  not  agree  with 
predictions  from  existing  theories  within  the  slip  flow  regime.  A  flow  model 
based  on  the  integral  approach  was  developed  by  Chow  [12]  to  study  hypersonic 
low-density  flow  past  a  sharp  leading  edge  of  a  flat  plate.  The  disturbed 
flow  field  is  divided  into  two  subregions,  namely,  the  continuous 
viscous -conducting  region  (shock -wave -like  region)  and  the  merged  layer  region 
(boundary  layer  like) .  The  Navier-Stokes  equation  is  assumed  to  be  applicable 
to  the  wave  region  and  for  the  merged  layer  region  a  cubic-parabolic  slip 
velocity  profile  is  employed  and  the  Crocco  integral  relationship  is  adopted 
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to  correlate  the  velocity  and  temperature  profiles.  Davis  and  Harvey  [25] 
presented  the  Monte-Carlo  direct  simulation  method  for  diatomic  molecules  and 
compared  their  results  with  experimental  results  for  a  zero- incidence 
flat-plate  in  Mach  21  low  Reynolds  number  flow.  The  skin  friction  and  drag 
measurements  on  flat  plates  were  presented  by  Boettcher,  et  all  [26],  in  the 
range  between  hypersonic  continuum  and  free  molecular  flow.  In  a  recent  paper 
[27],  Gokcen,  MacCormack  and  Chapman  carried  out  the  extensive  numerical 
calculation  for  hypersonic  flow  over  a  flat  plate  at  Reynolds  numbers  ranging 
from  continuum  to  the  free  molecule  flow  regime.  They  introduced  new  slip 
boundary  conditions  in  the  transition  flow  regime  and  solved  the  two 
dimensional  Navier-Stokes  equations.  Comparison  of  the  computed  results  with 
the  existing  experimental  data  and  Monte-Carlo  calculations  indicate  that  the 
continuum  Navier-Stokes  equations  give  surprisingly  realistic  results 
throughout  the  transition  regime,  when  the  new  boundary  conditions  are  used. 
Results  have  been  obtained  by  Hermina  [28]  for  the  diverging  flow  of  helium 
gas  along  a  flat  plate  using  a  direct  simulation  Monte -Carlo  technique.  Skin 
friction  results  are  presented  for  a  variety  of  transverse  mesh  resolutions 
corresponding  to  values  of  the  Knudsen  numbers,  Ku  ranging  from  2.452  to 
0.1226. 

In  this  report,  the  main  objective  has  been  to  obtain  the  results 
published  in  [27],  by  solving  the  two-dimensional,  unsteady,  compressible 
Navier-Stokes  equations  numerically  under  appropriate  velocity-slip  and 
temperature- jump  conditions  for  flow  past  a  flat-plate.  Calculation  were 
performed  for  Mach  numbers  equal  to  7  and  19.2  and  the  Reynolds  number  varying 

3 

from  10  to  10  .  The  effect  of  varying  the  grid  size  on  the  skin  friction 
coefficients  and  the  Nusselt  number  are  presented  and  compared  with  the 
existing  theoretical  and  experimental  results.  The  radiation  heat  transfer 
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contribution  is  also  considered  in  this  paper. 

athematical  Formulation  and  Numerical  Procedure 


The  equations  governing  the  hypersonic  flow  in  the  present  analysis  are  the 
time -dependent,  compressible,  two-dimensional  Navier-Stokes  equations.  A 
general  coordinate  transformation. 


C  -  l(x)  ,  q  -  »?(x,y) 


is  introduced  to  take  into  account  the  variable  grid  in  the  x-y  plane.  The 
governing  equations  can  then  be  written  in  the  so  called  conservative  form 
[29,30] , 


au  ^  .  aF  ^  3G  ^  3G  - 
at  a^  'x  dr)  y  dr) 


where  the  vector  fluxes  U,F  and  G  are  given  by. 
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The  components  of  the  shear-stress  tensor  and  heat-flux  are  given  by, 
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Derivation  of  the  components  of  the  radiation  flux  vector  and  are 
discussed  in  the  next  section.  The  equations  of  state  given  by  the  perfect 
gas  law,  Sutherland's  viscosity  law  and  assigned  Prandtl  number  (0.73) 
formally  close  the  system  of  equations.  The  equations  (2)  are  numerically 
integrated  in  the  region  0  <  x  <  1  along  the  flat  plate  and  0  <  y  <1  at  right 
angles  to  the  plate.  The  boundary  conditions  for  the  hypersonic  flow  in  the 
slip  flow  regime  have  been  discussed  by  many  authors  [  13,27  ].  On  the  plate 
(y— 0) ,  the  approximate  Maxwell  velocity-slip  and  temperature- jump  conditions 
are  given  by 

I 

i 
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(7) 


u(0)  -  7  (  |H 


T(0) 


2  -  g  27  i_  ,  ar 
w  a  7  +  1  Pr  ''  ay  ''O 


(8) 


where  a  is  the  fraction  of  diffusely  reflected  molecules  and  a,  the 
accommodation  coefficient  [  13,27  ].  Both  a  and  a  in  the  present  calculations 
are  assumed  to  be  unity.  The  upstream  and  initial  conditions  are  prescribed 
by  the  values  of  the  unperturbed  freestream. 

Initial  condition: 


U(  0  ,  ?  ,  n  )  -  (9) 

Upstream  condition: 

U(t,C,»?)-U^.  ?<0  (10) 

The  gradients  of  all  the  variables  U,  F,  and  G  are  assumed  to  be  zero  at  x-=l , 

0  <  y  <  1  and  0  <  x  <  1,  y-1,  valid  for  the  far- field  boundary. 

The  whole  flow  field  (0  <  x  <  1,  0  <  y  <  1)  is  divided  into  a  M  x  N  mesh 
such  that  the  mesh  is  stretched  at  right  angles  to  the  plate  exponentially 
near  the  plate  to  provide  adequate  resolution  within  the  viscous  boundary 
layer,  and  into  uniform  mesh  size  away  from  the  plate.  The  region  along  the 
plate  is  divided  into  uniform  mesh  size.  Figure  1  shows  a  typical  64  x  64 
mesh  used  in  the  present  study. 

The  explicit  MacCormack  method  [29,30]  involving  a  two-step  predictor  and 
corrector  finite-difference  scheme  is  used  to  integrate  the  system  of 
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equations  given  by  (2) .  The  terms  in  the  flux  vectors  F  and  G  are 
approximated  by  a  central,  forward  and  backward  differencing  scheme  in  such  a 
fashion  that  the  numerical  scheme  is  second-order  accurate  both  in  space  and 
time.  The  time  step  At  is  taken  to  be. 


At  < 


(1  +  _2_  ) 


(11) 


where  a  is  the  safety  factor  (=«  0.9)  and  (At)^pj^  is  the  inviscid  CFL  condition 
[30], 


(At) 


CFL  " 


(12) 


Re^  is  the  minimum  mesh  Reynolds  number. 


Re.  -  min  (Re, 

A  Ax 


Re 


Ay 


(13) 


where, 


<7  u  Ax  „  p  V  Ay  ,  T ,  ^ 

Re,  -  — -  ,  Re,  -  ^  (14) 

Ax  /i  Ay  p 

and. 

With  a  given  arbitrary  initial  guess  the  solution  of  the  time -dependent 
equations  are  obtained  by  an  iterative  process  approaching  an  asymptotic 
steady-state. 
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5.  Radiation  Heat  Transfer 

In  this  section,  we  will  discuss  a  methedology  to  solve  for  the  radiation 
transfer  contribution  on  the  energy  equation.  Consider  the  fluid  flowing  over 
a  flat  plate,  and  the  medium  being  absorbing,  emitting  and  semi- inf inite .  The 
radiative  transfer  equation  (RTE)  and  the  corresponding  boundary  condition 
(b.c.)  are  given  for  this  problem  as, 

M  T(r,e)l  (15) 


with, 

1 

1(0,^)  -  ;  1(0, dM 

0 

1 

-  [  T^(0]  +  2  (1  -  ;  I(0,e,.M')  d/i'  (16) 

where  I  is  the  radiation  intensity, 

y  X 

T  -  f  ^  dy'  and  ^  -  J  ^  dx' 

0  0 

are  the  optical  thicknesses  in  the  y-and  x-directions ,  respectively,  /?  is  the 
volumetric  extinction  (absorption)  coefficient  (in  m  ^) ,  T(t,  |)  is  the  medium 
temperature  distribution,  T^(C)  Is  the  wall  temperature  distribution,  and 
is  the  wall  emissivity.  Here,  we  will  consider  only  one  dimensional  radiative 
transfer  primarily  in  the  y-direction  (perpendicular  to  the  plate) .  The 
^-dependence  is  because  of  the  ^-dependence  of  the  temperature  distribution 
T(r,^).  It  is  worth  to  note  that  a  two-dimensional  formulation  of  the  RTE  is 
much  more  complicated,  and  we  will  not  treat  it  here. 
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The  radlaclon  term  which  appears  In  the  energy  equation  is  the  divergence 


of  the  radiative  heat  flux,  which  can  be  obtained  after  solving  equation(15) 
for  the  radiation  intensity.  For  a  gray  azimuthally  symmetric,  planer  medium, 
the  radiative  flux  is  give  as [31], 

q^(r,^)  -  2w  ^  d/i  (17) 

Then,  we  can  write  the  divergence  of  the  radiative  flux  distribution  in 
dimensionless  form  as. 


aQ^(r.g)  _ 
3r 


Ar.O  ■  I 


-d  • 


^0 


E2(r')  dr' 


E^(lr-r'()  dr' 
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where , 
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The  boundary  condition  is  given  as. 


Q’'(r-0.^) 


€ 

W 


4 


1) 


-  2  f  {«^(r' ,^)  -  1  )  E,(r')  dr'  ]  (20) 

0  ^ 

These  equations  are  written  for  a  uniform  medium  refractive  index,  n. 

After  the  energy  equation  is  solved  without  including  the  radiation 
effect,  the  temperature  distribution  is  used  in  equation  (4)  to  obtain  the 
aQ  /Sr  variation.  This  term  is  used  as  a  source  term,  and  Equation(18)  is 
solved  again.  This  iterative  procedure  is  followed  until  a  preset  convergence 
criteria  is  satisfied. 

Optically  Thin- Fluid  Approximation 

Although  many  simplified  asscunptions  were  introduced  in  obtaining  the 
RTE,  such  as  planer,  one -dimensional  system,  gray  non- scattering  medium 
radiatively,  homogeneous  fluid  layer  and  uniform-property  plate  surface, 
equation(18)  is  still  complicated.  We  can  simplify  it  further  by  considering 
the  physical  problem.  In  general,  the  model  developed  in  this  study  is 
applicable  to  flight-conditions  at  very  high  elevations,  where  gas-density  is 
significantly  low.  It  is,  therefore  possible  to  assume  that  the  volumetric 
extinction  (absorption)  coefficient  is  small,  which  yields  r, 

y(L) 

r  -  f  ^  dy  «  1  (21) 

0 

where  y(L)  is  the  upper- limit  of  the  domain  we  are  interested  in.  Following 
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this  we  note  that, 


E2(r) 


1  -  0(t) ,  e  ^  -  1  -  T  + 


(r^) 


(22) 


where,  0  denotes  the  order  of  the  neglected  terms.  Using  these  expressions, 
one  can  show  that  for  an  optically  thin  gas,  equation  (18)  is  rewritten  as. 


aQ^(r,^) 

dr 


[  d^ir.O  -  1]  -  I  1  ] 


(23) 


with  the  boundary  condition. 


Q’'(r-0,^)  - 


[{  C(l)  -  1) 
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-  2^[d 
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(24) 


In  order  to  take  into  account  the  spectral  nature  of  the  radiation,  we 
may  use  the  Planck  mean  absorption  coefficient  in  the  definition  of  optical 
thickness : 


r 


r 

P 


(25) 


Where  is  the  temperature  of  the  fluid  layer,  and  is  the  spectral 
absorption  coefficient  of  the  fluid. 


Optically  Thick- Fluid  Aooroximation 

We  can  simplify  the  RTE  considerably  if  we  assvime  that  the  gas  (or  fluid)  is 
optically- thick.  Although  it  may  not  be  applicable  to  the  practical  system  of 
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interest  of  this  study,  it  is  important  to  analyze  the  limit,  because  all 
practically  possible  situations  will  fall  between  the  optically  thin  and  thick 
approximation  results. 


For  this  case,  we  can  write  the  dimensionless  radiative  flux  as, 

i  X  (26) 

The  divergence  of  the  radiative  flux,  3Q  (r,^)/ar  is  obtained  from  Equation 
(10)  readily.  It  is  important  to  note  that  for  this  case  the  optical 
thickness,  is  based  on  the  Rosseland  mean  absorption  coefficient  as. 


r  —  r 


R  -  V  -  y  <  J"  r 

A— 0  u 


dA  ] 


-1 


(27) 


Solution  of  the  RTE  for  intermediate  optical  thickness 

If  the  optical  thickness  of  the  medium  is  neither  small  nor  large,  then  the 
radiation  contribution  is  obtained  by  solving  the  full  RTE.  The  solution  of 
this  integro- differential  equation  is  not  easy,  and  some  approximation  would 
be  desirable.  For  this  reason,  the  double  spherical  harmonics  (DP) 
approximation  will  be  applied  to  the  RTE.  This  approximation  has  been 
developed  recently  by  Menguc  and  Iyer  [32]  and  has  shown  to  be  very  reliable 
for  optical  thicknes  less  than  10,  and  can  account  for  an  anisotropically 
scattering  medium. 


6 .  Discussion  of  results 

The  numerical  calculations  have  been  performed  on  the  IBM  3090  (super) 
computer,  a  facility  available  at  the  University  of  Kentucky.  To  gain 
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experience  in  numerical  calculations  based  on  the  complete  Navier* Stokes 

equations  and  to  test  whether  the  numerical  scheme  is  adequate,  it  was  first 

attempted  to  reproduce  the  results  obtained  in  [27],  In  table  1,  results  for 

average  skin  friction  coefficients  calculations  are  given  for  M-7  and  R^-100 , 

3  4 

10  and  10  for  flow  past  a  flat  plate  under  the  no-slip  and  adiabatie  wall 
conditions.  Number  of  grid  points  and  the  spacing  for  the  first  grid  in 
y-direction  near  the  plate  are  also  given  in  the  table  The  results  are 
compared  in  Figure  (2)  with  those  given  in  [27],  and  it  is  found  that  the  skin 
friction  results  based  on  several  variations  of  number  of  grids  and  spacing 
are  a  few  percent  less  than  obtained  in  [27]. 

The  local  and  average  skin  friction  coefficients  are  defined  as. 
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Table  1:  Numerical  Flow-Field  Parameters  for  M-7 


grid  size 

64  X  48 

64  X  48 

64  X  48 

64  X  48 

96  X  96 

S  B- layer 

0.5 

0.5 

0.75 

0.75 

0.75 

^^min 

0.002 

0.001 

0.001 

0.0005 

0.0005 

Re 

00 

100 

0.4084 

0.4295 

0.3776 

0.4254 

0.4033 

1000 

0.0794 

0.0832 

0.0786 

0.0848 

0.0804 

10,000 

0.0171 

0.0191 

0.0178 

0.0188 

0.0175 
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For  hypersonic  flow  (i.e.  at  large  Mach  numbers),  the  exchange  of  heat 
between  surface  and  gas  consists  of  two  components,  namely: 

(1)  Thermal  conduction, 

(2)  work  done  by  shear  forces  due  to  the  existence  of  a  finite  slip  veloci  ty 
at  the  surface.  Then  the  total  heat  transfer  is  given  by  (+ve  into  the  body). 


r  1, 
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^y-0 


(30) 
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(31) 


The  results  for  the  velocity  slip  flow  conditions  have  been  obtained 
corresponding  to  two  wall  temperature  conditions: 

(a)  -  0  on  the  wall 

Sy 

(b)  temperature  jump  condition  on  the  wall,  T ^T^^-0.7. 

Numerical  experiments  were  run  on  different  grid  sizes  with  varying  grid 
spacing  normal  to  and  parallel  to  the  streamwise  direction, 

minimum  grid  spacing  at  the  surface  of  the  plate,  normal  to  the  flow  direction 

was  varied  from  0.05  to  0.0005.  A  value  of  AY  .  -  0.001  was  selected,  as 

min 

lower  values  did  not  have  any  significant  effect  on  the  skin  friction 
coefficient.  Three  grids  of  size  64  x  64  and  extending  1.4,  3.0  and  6  inches 
normal  to  the  plate  of  length  1.  inch  were  studied.  These  are  illustrated  in 
Figures(2a,b,c) ,  respectively.  The  grid  is  constructed  so  as  to  have  an 
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exponentially  increasing  grid  spacing  normal  to  the  wall,  and  from  the  leading 
edge  going  downstream.  This  ensures  a  good  definition  of  the  flow  field  in 
the  immediate  vicinity  of  the  wall  and  in  the  stagnation  region. 

For  M-7 ,  T velocity-slip,  experiments  were  run  for  Reynolds 
numbers-10,  50,  100,  and  1000.  Figure(3a,b)  show  the  variation  of  the  local 
skin  friction  coefficient  and  the  local  Stanton  number.  Two  grids,  given  in 
Figures ( 2a, b)  were  tested  in  these  cases,  both  giving  the  same  results.  As 
would  be  expected  with  the  existence  of  a  slip  velocity,  the  heat  flux  at  the 
wall  should  follow  the  same  trend  as  the  skin  friction  coefficient.  This  is 
because  of  the  term  occurring  in  equation(30) ,  which  contributes  to  the 

mechanical  heat  generation  at  the  wall.  The  profiles  given  in  figures(3a,b) 
indicate  an  increasing  trend  with  'x' ,  and  after  passing  a  maximum  begin 
decreasing  again  as  the  boundary  layer  develops  in  the  downstream  direction 
along  the  plate.  This  trend  can  be  explained  by  considering  the  mach  nianber 
contours  shown  in  Figures (4a,b, c) .  It  is  noted  that  in  the  leading  edge 
region  the  isomach  lines  have  a  higher  density  in  a  particular  region  of  the 
plate,  and  away  from  this  region,  the  lines  are  more  spaced  out.  Such  dense 
regions  contribute  to  larger  gradients  in  the  properties  of  the  flow  field, 
leading  to  higher  local  skin  friction  and  Stanton  number  values .  Close 
observation  of  figures(4a,b,c) ,  reveals  that  these  dense  regions  compare  in 
location  with  the  occurrence  of  the  maxima  in  figures(3a,b) .  For  Reynolds 
number-10,  the  results  proved  unsatisfactory,  and  hence  are  not  presented. 
Table  2  gives  the  average  value  of  the  skin  friction  coefficient  on  the  plate. 
As  the  Reynolds  nxunber  increases  the  skin  friction  or  Stanton  number  value  at 
any  point  decreases . 
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Table  2 .  M-7 :  T^/T^^-0.7:  Velocttv-slio 

Re  number  10  50  100 

0.552  0.195  0.159 

av 

St  0.74  0.16  0.14 

av 

For  M-19.2,  T ^T^^-0.7,  and  velocity-slip,  experiments  were  run  for 
Reynolds  numbers-10,  50,  and  100.  Figure ( 5a, b, 6) , show  the  variation  of  the 
local  skin  friction  coefficient  and  the  local  Stanton  number.  Two  grids, 
given  in  Figures (2a, c)  were  tested  in  these  cases,  both  giving  the  same 
results.  The  trends  followed  are  similar  to  those  presented  for  M-7.  The 
skin  friction  and  stanton  values  have  Increased  almost  proportionally  with  the 
increase  in  mach  number.  For  Reynolds  number— 10,  the  results  proved 
unsatisfactory,  and  hence  are  not  presented.  Table  3  gives  the  average  value 
of  the  skin  friction  coefficient  on  the  plate.  As  the  Reynolds  nvimber 
increases  the  skin  friction  and  Stanton  number  values  decrease  at  any 
location,  however,  downstream  of  the  maxima,  the  two  coalesce  to  reach  an 
asymptotic  value. 


Table  3.  M-19.2:  T^^ /T^^-0 . 7  :  Velocity- slip 

Re  number  10  50  100 

0.849  0.276  0.212 

av 

St  0.75  0.19  0.13 

av 


Numerical  experiments  were  also  performed  with  6T/6y  -  0 .  Even  though 
these  computations  did  stabilise,  the  resulting  flow  fields  that  ensued  were 
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highly  disorderd  and  are  currently  the  subject  of  study.  Further  studies  are 
underway  to  incorporate  the  radiation  transfer  equations  into  the  algorithm, 
and  to  follow  with  additional  numerical  experiments  and  flow  field  study. 


7 .  Conclusion 

The  explicit  MacCormack  numerical  scheme  based  on  the  two-dimensional  unsteady 
compressible  Navier-Stokes  equations  gives  reasonably  comparable  results  for 
the  skin  friction  and  heat  transfer  coefficients  for  Reynolds  number  equal  to 
and  greater  than  50.  More  time  and  effort  are  needed  to  make  conclusive 
statements  whether  the  Navier-Stokes  equations  are  adequate  to  describe  very 
low  density  regimes. 

ACKNOWLEDGEMENTS 

The  author  would  like  to  acknowledge  the  financial  support  given  by  the 
Air  Force  Office  of  Scientific  Research  and  the  Universal  Energy  Systems, 
Dayton,  Ohio.  He  would  like  to  thank  Dr.  Joe  Shang  for  providing  him  the 
numerical  scheme  package,  for  continued  suggestions  and  discussions,  without 
which  the  numerical  calculations  could  not  have  been  possible. 


8 .  References 

1.  Truitt,  R.W. ,  Hypersonic  Aerodynamics .  Ronald  Press,  New  York,  1959. 

2.  Hayes,  W.D. ,  and  Probstein,  R.F.,  Hypersonic  Flow  Theory.  Academic  Press, 
New  York,  1959. 

3.  Chernyi,  G.G. ,  Introduction  to  Hypersonic  Flow  (translated  from  Russion 
by  R.F.  Probstein),  Academic  Press,  New  York,  1961. 

4.  Dorrance,  W.H.,  Viscous  Hypersonic  Flow.  McGraw-Hill  Book  Co.,  New  York, 
1962. 

5.  Cox,  R.N.  and  Crabtree,  L.F.,  Elements  of  Hyperson.^c  Aerodynamics. 
Academic  Press,  New  York,  1965. 

6  Anderson,  J.D.,  AIAA,  paper  No.  85-1578,  Jan.  1984. 

7.  Nagamatsu,  H.T.  and  Sheer,  R.E.,  Jr.,  AIAA  paper  No.  85-0999,  June  1985. 

8.  Holden,  M.S.,  AIAA  paper  No.  86-0267,  Jan.  1986, 

9.  Tirmalesa,  D. ,  AIAA,  J.,  1968,  Vol.  6,  369-370. 

10.  Potter,  J.I,.,  Rarefied  Gas  Dynamics.  Academic  Press  Inc.,  New  York,  1967, 
suppl.  4,  Vol.  II,  pp  881-937. 

11.  Moulie,  E.S.,  Jr.,  Space  Sciences  Lab.  Uniy.  of  Calif.  Berkeley,  TR 
Series  7,  Jan.  1968. 

12.  Chow,  W.L.,  AIAA,  J.,  5,  1967,  1549-1557. 

13.  Schaaf,  S.A.  and  Chambre,  P.L.,  Flow  of  Rarefied  gases.  Vol.  Ill,  Sec. 

H. ,  1958,  pp  687-793. 


58-20 


14.  Liu,  V.C.,  J.  Fluid  Mech.  5,  1968,  481-490. 

15.  Charwat,  A.F.,  Rarefied  Gas  Dynamics .  Academic  Press  Inc.,  New  York  1961, 
suppl.  1,  pp,.  553-578. 

16.  Bird,  G.A. ,  AIAA  J . ,  4,  1966,  55-67. 

17.  Oguchi,  H. ,  Rarefied  Gas  Dynamics.  Academic  Press  Inc.,  New  York,  1963, 
suppl,  2,  Vol,  II,  pp.  181-193. 

18.  Pan,  Y.S.  and  Probstein,  R.F.,  Fundamental  Phenomena  in  Hypersonic  Flow. 
Cornell  Univ.  Press,  Ithaca,  N.Y. ,  1966,  Rept.  195  (1952) 

19.  Ghuan,  L.  and  Waiter,  S.A.,  Rarefied  Gas  Dynamics .  Academic  Press  Inc., 
N.Y.,  1963,  Suppl.  2,  Vol.  II,  pp.  328-342. 

20.  Vidal,  R.J.  and  Wittliff,  C.E.,  Rarefied  Gas  Dynamics.  Vol,  II,  1963. 

21.  McCrosky,  W.J.,  Bogdonoff,  S.M.  and  McDougall,  J.,  AIAA  J.  4,  1966, 
1580-1588. 

22.  Schaaf,  S.A.,  Hurlbut,  F.C.,  Talbot,  L.  and  Aroesty,  J.,  ARS,  J.,  29, 
1959- ,  527. 

23.  Nagamatsu,  H.T.  and  Li,  T.Y. ,  J.  Aero.  Sci,  20,  1953,  345-355. 

24.  Nagamatsu,  H.T.  and  Sheer,  R.E.  Jr.,  Hypersonic  Gas  Dynamics .  AIAA,  20th 
Thermophysics  Conf . ,  June  19-21,  1985,  VA. 

25.  Davis,  J.  and  Harvey,  J.K.,  Rarefied  Gas  Dynamics.  Vol. 51,  1976,  335-48. 

26.  Beottcher,  R.D.,  Koppenwallner ,  G.  and  legge,  H. ,  Rarefied  Gas  Dynamics. 
Vol,  51,  1976,  349-60. 

27.  Gokcen  T. ,  MacCormack,  R.W.  and  Chapman,  D.R. ,  Computational  Fluid 
Dynamics  Near  the  Continuum  Limit.  1987. 


63-21 


28.  Hermina,  W.L.,  J .  Thermophvs i cs .  Vol.  3,  pp.  7-12,  1989. 

29.  Shang.  J.S.  and  Hankey,  W.L.,  AIAA  J . ,  Vol.  15,  pp.  1575-1582,  1977. 

30.  Anderson,  D.A. ,  Tannehill,  J.C.  and  Fletcher,  R.H. ,  Computational  Fluid 
Mechanics  and  Heat  Transfer.  Hemisphere  Publ.  Corp. ,  NY,  pp.  479-489,  1984. 

31.  Ozisik,  M.N. ,  Radiative  Transfer  and  Interaction  with  Conduction  and 
Convection.  Wiley,  New  York,  NY,  pp.  314-316  and  485-521,  1973. 

32.  Menguc,  M.P.  and  Iyer,  R.K. ,  J.  Quant.  Spectrosc.  Radiat.  Transfer.  Vol. 
39,  pp.  445-461,  1988. 


68-22 


..Aft***' 


A/<»  =  7 


Present  Study 

=  0  No-slip 
0.6  —  0.8  Experiment'* 

0.7  -  Monte- Carlo'* . .  — 

^  =  0  Generalized  slip 
^  =  0  Maxwell  slip 

0,7  Generalized  slip  -r  T,  —  T,; 


Skin  friction  drag  vs  Reynolds  number. 


Figure (1) 


U)eal  Sldn  Friction  Coeffleient 

M*7..  velocity-slip 


.  flviSa  X  It  =0.7 

w  aw 


a  (I.-IOO 

^  n«.iooo 


Local  Stanton  Summer 

(11=7.,  velocity-slip 


ft  ••SO 
O  RxlOO 
^  flolODO 


T  /T  =0-7 
w  aw 


oooooooooooocooo 


OOOOaOOOOOOOQOOO 

'****♦*«*♦***♦♦**♦♦♦*♦*♦♦**♦*♦ 


Qj  OA  U  (U> 


V 


OOOOOOOOOOOOOOOOOQ 


*»»•»**  «  *  *«»»**»**«**** 


or  "mo*  to 


(b) 


(a) 


Figure (3) 


68-25 


Re  no .  =  1000 


68-26 


>»3 


00 

c 

l-t 

n 


68-28 


FINAL  REPORT  NUMBER  69 
REPORT  NOT  AVAILABLE  AT  THIS  TIME 
Dr.  Gary  Slater 
760-7M6-08B 


69  -1 


FINAL 
REPORT  NOT 
Dr. 


REPORT  NUMBER  70 
AVAILABLE  AT  THIS  TIME 
Forrest  Thomas 
760-7MG-080 


70-1 


FINAL  REPORT  NUMBER  71 
REPORT  NOT  AVAILABLE  AT  THIS  TIME 
Dr.  William  Wolfe 
760-7MG-102 


71  -1 


FINAL  REPORT 


A  CHEMICAL  KINETICS  MODEL  FOR 
MACH  5  -  14  HYPERSONIC  FLOW 


T.  Gary  Yip  * 


Department  of  Mechanical  Engineering 
SANTA  CLARA  UNIVERSITY 


Santa  Qara,  CA  95053 
408  -  554  -  4937 


Submitted  To 


UNIVERSAL  ENERGY  SYSTEMS,  INC 

4401  Dayton-Xenia  Road 
Dayton,  Ohio  45432 

Contract  No.  F49620-85-C-0013/SB585 1-0360 
Purchase  Order  No.  S-760-6MG-109 

October  5,  1988 


*  Formerly  at  the  Department  of  Aeronautical  &  Astronautical  Engineering, 
The  Ohio  State  University. 


ABSTRACT 


This  study  is  aimed  to  determine  the  adaquacy  of  existing  chemical  kinetics  data  for 
numerical  predictions  of  hypersonic  flow  fields  at  Mach  numbers  up  to  14,  equivalently  at 
temperatures  up  to  10,000  K-  The  one -dimensional  non-equihbrium  flow  field  behind  a 
normal  shock  is  used  here  to  study  the  sensitivity  of  the  flow  properties  to  the  different 
reactions  and  the  uncertainties  in  the  values  of  their  rate  constants.  Known  or  assumed 
factors  of  uncertainty  are  introduced  systematically  into  a  series  of  flow  field  predictions. 
The  results  are  compared  to  obtain  the  sensitivity  information. 

The  higher  rate  of  dissociation  of  O?,  due  to  the  lower  dissociation  energy  of  the 
species,  combined  with  a  factor  of  2  uncertainty  in  the  rate  data  have  been  predicted  to 
cause  as  much  as  1,000  K  deviation  in  the  temperature  profile.  A  similar  degree  of 
deviation  has  also  been  predicted  when  the  dissociation  rate  constant  of  NO  is  assumed 
to  have  a  factor  of  4  uncertainty.  GDntrarily,  the  flow  properties  are  found  to  be  much  less 
sensitive  to  the  larger  uncertainty  in  the  rate  data  for  the  dissociation  of  N2.  These  results 
suggest  that,  for  Mach  14  or  below  hypersonic  flow  fields,  the  reliability  of  the  predicted 
flow  conditions  depends  largely  on  the  accuracy  of  the  O2  and  NO  data  while  the  existing 
uncertainty  in  the  N2  data  is  expected  to  introduce  relatively  insignificant  errors. 

Identification  of  the  dominant  reactions  and  species  in  Mach  14  or  below  flow 
fields  has  resulted  in  two  chemistry  models  which  differ  by  one  reaction.  The  reactions 
common  to  both  models  are  the  dissociations  of  O2,  N2  and  NO,  and  the  shuffle  reactions 
O  4-  N2  ♦*  NO  4-  N  and  N  4-  O2  ♦*  NO  4-  O.  When  the  objective  of  a  numerical 
simulation  of  hypersonic  flow  is  concerned  with  the  presence  of  free  electrons,  the  ionizing 
atomic  collision  N  4-  O  ♦♦  NO'*'  4-  e*  must  be  included  in  the  model. 
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INTRODUCTION 


Numerical  predictions  of  chemically  non-equilibrium  flow  fields  require  an  accurate 
chemistry  model  and  a  set  of  reliable  reaction  rate  data.  For  high  temperature  air,  a 
number  of  models  and  a  wealth  of  rate  data  exist  in  the  literature.  The  resurgence  of 
hypersonic  flow  research  has  prompted  frequent  use  of  these  models  and  rate  data, 
recently  and  in  the  future,  to  predict  the  flow  conditions  about  high  speed  vehicles. 
Unfortunately,  such  flow  predictions  have  been  and  are  expected  to  be  received  with 
considerable  skepticism  about  their  reliability  due  to,  at  least,  the  following  three  reasons. 
The  first  one  is  the  uncertainties  in  the  rate  data.  The  second  is  that  the  rate  data  are 
used  to  predict  flow  fields  at  temperatures  which  are  outside  the  range  in  which  the  data 
were  taken.  The  last  reason  is  the  chemistry  model  may  not  include  all  the  reactions  that 
are  important  to  the  chemistry  of  air  in  the  temperature  range  of  the  predicted  flow  field. 
The  present  study  is  aimed  to  ascertain  the  reliability  of  the  existing  chemical  kinetics 
information  for  the  non-equilibrium  chemistry  of  air  at  temperatures  encountered  in  Mach 
14  and  below  hypersonic  flow  fields. 


SCOPE 


The  primary  objective  of  this  study  is  to  validate  a  chemical  kinetics  model  for  use 
in  predicting  Mach  5-14  hypersonic  chemically  non-equilbrium  flow.  Validating  the  model 
involves  identifying  the  more  dominant  reactions  and  species,  and  determining  the 
sensitivity  of  predicted  flow  fields  to  the  uncertainties  of  the  presently  available  rate  data. 
The  less  important  reactions  and  species  will  be  eliminated.  The  study  is  performed  using 
the  one-dimensional  chemically  non-equilibrium  flow  field  behind  normal  shocks  at  Mach 
numbers  8,  12  and  14.  The  inviscid  flow  governing  equations  and  the  master  rate 
equations  of  the  non-equilibrium  chemistry  are  solved  simultaneously  for  numerical 
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solutions.  Factors  of  uncertainty  in  the  rate  data  are  introduced  and  the  numerical  results 
are  compared  to  determine  the  sensitivity  of  the  flow  variables  to  the  chemistry  model. 

Throughout  this  study  thermal  equilibrium  is  assumed  to  prevail  in  the  predicted 
flow  fields,  hence  the  effects  of  vibrational  non-equilibrium  are  not  included.  While  this 
assumption  is  in  fact  consistent  with  the  experimental  condition  that  the  rate  data  were 
determined  for  thermal  equilibrium,  it  is  intended  for  the  present  sensitivity  study  to  be 
conducted  in  a  marmer  consistent  with  the  source  of  the  rate  data  rather  than  to  reduce 
the  importance  of  vibrational  non-equilibrium  in  a  hypersonic  flow  field. 


FLOW  MODEL 

The  flow  field  considered  in  this  study  begins  with  a  steady  free  stream  flowing  into 
a  stationary  normal  shock.  Chemical  reactions  are  assumed  to  be  absent  from  the  shock 
front  and  the  shocked  gas  at  thermal  equilibrium  behind  the  shock.  The  flow  propenies 
relax  with  the  non-equilibrium  chemistry  as  the  shocked  gas  is  flowing  downstream. 

The  Hugoniot  and  Rayleigh  equations  for  a  normal  shock 


h2  -  h2  =  1/2  PiVj(p2/p2  -  1)(V2/V;  -h  1) 


and  -  =  (P2/Pi  -  l)/(v2/v/  -  1) 

are  solved  for  the  shocked  gas  conditions,  which  are  also  the  initial  conditions  of  the  non¬ 
equilibrium  flow  field.  The  symbol  v  represents  the  specific  volume  and  the  subscripts  1 
and  2  the  pre-  and  post-shock  conditions,  respectively. 


72-4 


The  steady,  chemically  non-equilibrium  flow  is  governed  by  the  following  one¬ 
dimensional  flow  equations; 


Global  Continuity 

du 

dP 

P - 

+  u  -  = 

dx 

dx 

Species  Continuity 

dy,- 

mj  dCj 

dx 

pu  dx 

where  i  =  1, . ,  n 

n  =  number  of  species 
Yl  =  mass  fraction  of  species  i 
Cj  =  concentration  of  species  i 
m,-  =  molecular  mass  of  species  i 


Momentum 


Energy 


dp 

du 

=  -  pu  - 

dx 

dx 

dh 

1 

dp 

=  n 

dx 

P 

dx 

dh 

dT 

dy,- 

— 

— 

+  r  hi  - 

dx 

dx 

dx 

Equation  of  State  p  =  pRT 

where  R  is  the  mixture  specific  gas  constant.  There  are  n+4  equations  and  the  same 
number  of  unknowns,  namely,  y;,  ...  y„,  u,  T,  p  and  p.  The  source  terra  in  each  of  the 
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species  continuity  equation  is  determined  from  a  master  rate  equation  which  accounts  for 
the  production  and  consumption  of  the  species  by  reactions  involving  that  species. 


CHEMISTRY  MODEL 


For  a  free  stream  temperature  of  300  K,  the  temperature  immediately  behind  the 
front  of  a  normal  shock  in  the  Mach  5-14  range  is  below  10,000  K.  In  this  temperature 
range  the  important  reactions  needed  to  properly  model  the  non-equilibrium  chemistry  of 
a  N2-O2  system  such  as  air  are  rather  well  known  and  their  rate  data  are  also  available. 
Furthermore,  most  ionization  processes  are  active  at  such  a  low  level  that  only  the  ionizing 
collision 


N  +  O  NO+  +  e* 

is  responsible  for  the  production  of  the  small  amount  of  free  electrons  in  flow  fields  below 
Mach  14  K  Therefore,  limiting  the  study  to  the  Mach  5  -  14  range  helps  to  reduce  the 
number  of  reactions  to  be  included  in  the  chemistry  model.  This  plus  the  fact  that  the 
reactions  in  the  model  are  quite  well  understood,  except  for  uncertainties  in  the  rate  data, 
ensure  a  reasonable  degree  of  reliability  in  the  sensitivity  results  of  the  present  work. 

Previous  shock  tube  and  chemical  kinetics  studies  suggest  that  the  important 
reactions  in  air  at  temperatures  below  10,000  K  are: 

O2  +  M*»0  +  0  +  M  (1) 

N2  +  M  N  +  N  +  M  (2) 
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and 


N0-hM«*N  +  0  +  M 

(3) 

0  +  N2  NO  +  N 

(4) 

NO  +  0  N  +  O2 

(5) 

N  +  0  NO+  +  e- 

(6) 

where  M  can  be  O2,  N2,  NO,  O  or  N.  Beside  reactions  (3),  (4)  and  (5)  other  reactions 
like 

NO  +  NO**  N2O  +  O  (7) 

and  N2O  +  O  ♦♦  N2  +  O2  (8) 

were  thought  to  have  significant  influence  on  the  overall  kinetics  of  NO.  Counting  the  five 
collision  partners  in  the  three  dissociative  reactions,  a  total  of  twenty  reactions  are  used 
at  the  beginning  of  this  study.  As  will  be  explained  later,  the  species  N2O  and  reactions 
(7)  and  (8)  were  included  in  the  study  initially  and  found  to  have  no  significant  influence 
on  the  overall  chemical  kinetics  in  the  flow  fields  considered  here.  Therefore,  reactions 
(1)  to  (6)  are  sufficient  for  modeling  Mach  8  -14  hypersonic  flow. 

RATE  DATA 

The  motivation  of  this  review  is  to  undwstand  the  limitations  of  and  the 
uncertainties  in  the  existing  chemical  kinetics  information.  The  goal  is  to  obtain  a  set  of 
rate  data  appropriate  for  use  in  the  present  sensitivity  study.  There  is  no  pretense  to 
suggest  that  the  chemical  kinetics  data  mentioned  below  or  those  in  the  open  literature  are 
good  enough  for  hypersonic  flow  research.  Instead,  it  is  the  purpose  of  this  study  to 
determine  which  rate  constants  have  unacceptably  large  uncertainty  and  therefore  requires 
improvement. 
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The  notation  k(A,B)  is  used  below  to  represent  the  dissociation  rate  constant  of 
species  A  when  it  collides  with  species  B.  Here  the  unit  of  concentration  is  moles/cc  and 
temperature  in  Kelvins.  The  rate  constant  expressions  used  in  this  study  are  summarized 
in  Table  1. 


Reaction  ri"):  0;)-fM**04-Q  +  M 

Among  the  aforementioned  reactions,  the  dissociation  of  O2  and  the  reverse  three- 
body  recombination  reaction  are  probably  the  most  studied  reactions.  There  is  a  general 
consensus  on  the  relative  efficiency  of  the  different  collision  partners  in  the  dissociation  of 
oxygen  molecules: 


k(02,0)  *  3  X  k(02.02), 
k(02,02)  =  10  X  k(02,Ar), 
k(02,N2)  =  1/4  X  k(02,02), 

and  k(02,N)  =  k(02,N0)  =  k(02^). 

The  different  rate  constants  can  be  determined  from  k(02yAr). 

Figure  1  shows  the  rate  expressions  of  k(02,Ar)  from  three  sources.  Wray  ^  (1963) 
fitted  data  up  to  18,000  K  to  the  expression 

k(02,Ar)  =  2.5  X  10  T  '  exp(-59,400/T). 

Bortner  (1969)  later  curve  fitted  data  from  nine  studies.  The  resulting  expression  gives 
a  rate  value  that  is  a  factor  of  3  smaller  than  Wray’s  at  10,000  K.  The  difference  between 
the  two  expressions  decreases  as  the  temperature  drops.  The  two  agree  with  one  another 
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at  4,000  K. 


The  most  recent  data  was  reported  by  Breshares  et  al  ^  in  1972.  Using  the  laser 
schlieren  technique,  their  data  has  a  factor  of  two  uncertainty  and  O2  was  found  to  be  9.6 
times  more  effective  than  argon.  According  to  the  review  performed  by  Boyd  and  Bums-^ 
(1982),  the  rate  expression  obtained  by  Breshares  et  al 

k(03Ar)  =  7.87  x  10^^  exp(-52,760/T) 

is  by  far  the  most  reliable  with  a  factor  of  2  uncertainty  and  is  used  in  this  study. 
Reaction  (2\.  N2  +  M**N  +  N  +  M 

Only  the  rate  data  for  N2  +  N2  ♦*  2N  +  N2  and  N2  +  N  «*  3N  are  available  in  the 
literature.  There  is  no  data  for  the  the  collision  partners  NO,  O2  and  O.  The  efficiencies 
of  these  three  species  have  often  been  assumed  to  be  the  same  as  argon. 

For  M  =  N2,  Bortner  (1969)  curve  fitted  the  data  obtained  by  Cary  ^(1965)  and 
Bryon^  (1966)  at  temperatures  above  6,000  K  and  those  recorded  below  1,000  K  by  others. 
The  resulting  expression  agrees  very  well  with  the  one  recommended  by  Wray  ^  (1962). 
Baulch  et  al  ^  (1973)  recommended  the  expression“bbtained  by  Appleton  et  al  (1968) 
which  is  known  to  give  much  lower  values  than  ail  the  other  studies,  including  a  later  study 
by  Hanson  &  Baganoff  (1972).  All  these  expressions  are  plotted  in  Figure  2.  As  a 
result  of  the  higher  temperature  dependence,  the  rate  by  Hanson  et  al  is  higher  than  the 
ones  by  Bortner  and  Wray  and  the  discrepancy  increases  with  decreasing  temperature. 
The  expression  recommended  by  Bortner 

k(N2,N2)  =  5  X  10^«  T-7^  exp(-113,200/T) 
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is  used  in  this  study. 

For  M  =  N,  the  volume  of  data  is  rather  small.  The  rate  expressions  by  Wray* 
(1962),  Bortner  (1969)  and  Hanson  &  Baganoff  (1972)  agree  to  within  a  factor  of  2 
at  temperatures  below  10,000  K  (Figure  3).  The  one  Bortner  obtained  by  curve  fitting 
data  from  four  studies 

k(N2,N)  =  3.0125  X  lO^^  exp(-113,200/T) 


is  used  here. 

There  are  contradictory  reports  on  the  relative  efficiency  between  and  N  in 
causing  the  dissociation  of  N2.  Bryon’s  results  suggest  that  N  is  about  10  times  more 
effective  than  N2  at  9,000  K  and  15  times  at  6,000  K.  Hanson  &  Baganoff  reported  an 
opposing  trend  that  the  relative  efficiency  is  8  at  12,000  K  and  it  drops  to  2  at  6,000  K. 
Appleton’s  results  suggest  a  temperature  independent  ratio  of  4.  Furthermore,  no 
explanation  has  been  given  for  the  large  discrepancy  between  the  data  of  Appleton  et  al 
and  the  other  investigators  as  mentioned  earlier. 

For  M  =  Ar,  the  data  obtained  by  Cary  ^  (1965)  and  Bryon  ^  (1966)  differ  by  less 
'than  a  factor  of  2,  Bortner  curve  fitted  their  data  and  some  other  low  temperature  data. 
The  resulting  expression  yields  rates  very  close  to  those  of  Bryon  (Figure  4).  For  the  6000 
-10,000  K  range,  Bryon’s  expression 

k(N2Ar)  =  1-9  X  10^7  T-ftJ  exp(-113,200/T) 

is  used  here.  For  the  3,390  K  -  6,435  K  range,  Thielen  and  Roth  (1986)  determined 
the  rate  expression 
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k(N2/u-)  =  1.03x102«  exp(-113,220yT) 


which  has  a  different  temperature  dependence  from  Bryon’s.  The  two  expressions  agree 
at  6,210  K-  At  lower  temperatures  the  dissociation  rate  of  N2  is  so  small  that  the 
difference  between  the  two  expressions  is  be  found  in  this  study  to  have  no  significant 
effect  on  the  overall  rate  of  dissociation. 

In  summary,  the  factor  of  uncertainty  in  the  rate  data  for  N2  dissociation  is  a  factor 
of  2  for  the  collision  partner  N  and  about  3  for  the  others. 


Reaction  ('3^:  NO  +  M*»N  +  Q4-M 

Presently,  the  dissociation  rate  data  of  this  reaction  is  extremely  scarce  and  its 
reliability  is  questionable.  Freedman  &  Daiber  (1961)  studied  NO  dissociation  at  3000  - 
4300  K  and  Wray  &  Teare  (1962)  at  3,000  -  8,000  K.  Determining  the  dissociation  rate 
constant  of  NO  is  often  complicated  by  the  simultaneous  presence  of  other  reactions  in  the 
test  gas.  Examples  of  these  reactions  include  NO  +  NO  ♦*  N2O  +  O  and  NO  +  N  N2 
+  O.  To  circumvent  this  problem,  Wray  &  Teare  had  to  assume  the  relative  effectiveness 
of  the  different  collision  partners.  The  rate  constahnc(NO,NO)  was  found  to  be  20  times 
of  k(NO,Ar)  with  the  assumption  that  O  and  N  are  equally  effective  as  NO,  and  O2  and 
N2  as  At  in  causing  the  dissociation  of  NO. 

In  the  present  sensitivity  study,  the  rate  constants  recommended  by  Bortner^ 

k(NO,NO)  =  8  x  10^^  exp(-75,400/T) 

k(NO,N)  =  k(NO,0)  =  k(NO,NO) 
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k(NO,N2)  =  9.0  X  10^“^  exp(-75,400A')  =  k(N0,02) 


are  used.  These  expressions  are  essentially  the  same  as  those  obtained  by  Wray  and 
Teare.  Since  no  other  data  is  available  for  comparison,  the  factor  of  uncertainty  cannot 
be  determined.  For  the  purpose  of  studying  the  sensitivity  of  the  flow  properties  to  this 
reaction,  a  factor  of  4  is  used. 


Reaction  r4h  O  -f  N->  ♦*  NO  +  N.  and  Reaction  fSh  NO  -i-  O  ■>*  N  -f 

The  formation  of  NO  at  moderately  high  temperature  is  primarily  controlled  by  the 
so-called  Zeldovich  mechanism  which  is  made  up  of  these  two  reactions.  Their  rate 
determine  nature  in  combustion  systems  has  prompted  many  studies  of  their  rate  constants. 
By  now  these  rate  constants  are  reasonably  well  characterised. 

Glick  et  al  ^^(1957)  and  Wray  &  Teare  (1962)  determined  the  forward  rate  of 
reaction  (4).  Their  results  were  later  substantiated  by  a  number  of  combustion  studies. 
Baulch  et  al  curve  fitted  all  the  data  to  produce  an  expression  with  a  factor  of  2 
uncertainty  in  the  temperature  range  2,000  -  5,000  K.  Monat,  Hanson  &.  Kruger  (1978) 
used  two  independent  techniques  to  monitor  NO  concentration,  thus  providing  a  check  on 
the  data  and  reducing  experimental  uncertainty.  The  forward  rate  constant  obtained  by 
Monat  et  al 


kf  =  1.84  X  exp(-38384/T) 


is  used.  This  expression  yields  rate  values  close  to  the  upper  values  of  Baulch’s  expression. 


For  the  reaction  NO  +  O  •**  N  +  O2,  Wray  &  Teare  ^  reported  the  only  high 
temperature  data  at  5,000  K-  Hanson  et  al  (1974)  reported  data  in  the  temperature 
range  2,500  -  4,100  K  and  also  a  rate  constant  expression  that  fits  all  the  data  from  300 
to  5,000  K  with  extremely  small  scatter.  Therefore,  the  expression  due  to  Hanson  et  al 
is  used  here: 

kf  =  2.36  X  10  ^  T  exp(-19430/T). 

Presently  there  is  no  data  available  at  temperatures  above  5,000  K. 

Reaction  (6):  NO  +  NO  N^O  +  O  and  Reaction  f7):  N?0  O  **  N->  + 

The  reaction  NO  +  NO  ♦*  N^  +  O2  had  been  mistaken  as  the  primary  NO  removal 
mechanism  until  Camac  and  Feinberg  (1966)  found  in  a  shock  tube  study  of  NO 
decomposition  that  N2O  and  O  were  produced  at  a  rate  compatible  with  the  removal  rate 
of  NO  by  the  reaction  NO  +  NO  ♦*  N2O  +  O.  Other  researchers  re-evaluated  their  data 
and  found  agreement  with  the  results  of  Camac  and  Feinberg.  The  rate  constant 
expression  for  reaction  (6)  derived  by  Kaufman  and  Kelso,  also  recommended  by  Baulch 
is  used  here: 

kf  =  1.3  X  10^2  exp(-32,100/T). 

The  removal  of  N2O  is  known  to  occur  via  the  reactions  N2O  -f-  O  ♦♦  N2  +  O2  and 
N2O  -H  O  ♦*  NO  4*  NO.  The  relative  efficiency  of  the  two  reactions  is  around  unity  at 
temperature  above  1000  K.  The  forward  rate  expression  recommended  by  Baulch  ^ 

kf  =  1.0  X  10^"^  exp(-14,100/T). 
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used  here. 


Leaction  TS"):  N  -t-  O  ♦*  NO"**  +  e- 

-  There  are  different  kinds  of  ionization  processes  that  have  been  known  to  occur  in 
ligh  temperature  air.  The  dominancy  of  the  different  processes  depends  largely  on 
emperature.  For  Mach  5  *  14  flight,  the  available  energy  is  so  small  that  the  ionizing 
itomic  collision 

N  +  O  -  NO+  +  e- 

is  responsible  for  producing  most  of  the  free  electrons  K  This  reaction  requires  less  initial 
ener^  than  the  ionization  potential  of  NO  by  an  amount  released  when  NO  is  formed  in 
the  collision  of  N  and  O.  The  rate  constant 

k;  *  5.181  X  10^^  exp(-31900/T) 

recommended  by  Bortner  ^  is  used  here.  Other  ionic  species  are  not  included  in  the 
present  chemistry  model  because  their  concentrations  have  been  found  to  be  at  least  two 
orders  of  magnitude  smaller  K 

SOLUTIONS 

Numerical  solutions  of  the  1-D  non-equilibrium  flow  field  have  been  obtained  for 
Mach  8,  12  and  14  normal  shock  waves.  The  free  stream  conditions  are  322  K  and  1.216 
mmHg  and  the  composition  of  the  free  stream  air  is  taken  to  be  21%  O2  +  79%  N^  by 
mole. 
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The  governing  equations  of  the  chemically  non-equilibrium  flow  field  are  solved 
using  the  DGEAR  stiff  equation  solver  of  the  IMSL.  The  shock  front  solutions  are  the 
initial  conditions.  Some  of  the  results  are  shown  in  Figures  5-12  and  will  be  examined 
in  detail  in  the  next  section. 


RESULTS  AND  DISCUSSION 


Figures  5,  7  and  9  show  the  composition  profile  behind  normal  shocks  at  Mach  8, 
12  and  14.  The  length  of  the  relaxation  zone  has  been  predicted  to  reduce  significantly 
with  increasing  Mach  number.  It  is  about  20  meters  at  Mach  8,  30  cm  at  Mach  12  and 
5  cm  at  Mach  14. 

At  all  three  Mach  numbers,  the  temperature  of  the  flow  field  is  sufliiciently  high  for 
a  very  significant  degree  of  0^  dissociation  to  occur.  At  Mach  8  the  mole  fraction  of 
atomic  oxygen  increases  from  zero  right  behind  the  shock  front  to  10  percent  at  the  end 
of  the  relaxation  zone.  At  Mach  12  the  final  mole  fraction  of  O  is  25  percent  while  the 
mole  fraction  of  O2  is  less  than  1  percent.  Similar  results  have  been  predicted  in  the 
Mach  14  case.  At  all  three  Mach  numbers,  the  mole  fraction  of  atomic  nitrogen  is  less 
than  or  around  1  %  indicating  the  dissociation  of  is  much  less  significant  than  that  of 

O2. 


Figures  6,  8  and  10  show  the  Mach  number,  temperature,  density  and  pressure 
profiles  in  the  chemically  non-equilibrium  flow  fields.  The  dissociation  reactions  consume 
a  significant  amount  of  the  internal  energy  of  the  shocked  gas  and  cause  the  temperature 
to  drop  by  almost  one  half  of  the  beginning  value.  The  pressure  varies  only  slightly  within 
the  relaxation  zone  while  the  density  increases  by  as  much  as  50  percent. 
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Significance  of  N2O  in  the  Chemistry  of  High  Temperature  Air 

At  Mach  8  and  above,  the  mole  fraction  of  N2O  in  the  relaxation  zone  has  been 
predicted  to  be  very  small.  When  the  two  reactions  involving  this  species  are  excluded 
from  the  chemical  kinetics  model,  no  observable  changes  in  the  concentrations  of  the  other 
species  and  the  flow  properties  can  be  found.  This  is  consistent  with  the  experimental 
results  obtained  by  Myerson  ’  that  reaction  (7)  is  important  for  the  removal  of  NO  only 
at  low  temperatures.  Consequently,  the  reactions 

NO  +  NO  -  N2O  +  O  (7) 

and  N2O  +  O  ♦*  N2  +  O2  (8) 

can  be  eliminated  from  the  chemistry  model,  which  now  consists  of  6  reactions  and  7 
species. 


Significance  of  the  Reaction  N  +  O  -♦  NO"^  +  e‘ 

As  shown  in  Figures  5,  7  and  9,  the  concentr^ions  of  free  electrons  and  NO"'"  ions 
in  the  non-equilibrium  flow  fields  are  small  compared  to  the  concentrations  of  the  neutral 
species.  To  test  the  influence  of  this  reaction  on  the  conditions  of  the  flow  fields,  this 
ionizing  reaction  is  excluded  from  the  chemistry  model.  The  predicted  flow  fields  have 
been  found  to  have  no  observable  changes.  Therefore,  when  the  objective  of  a  numerical 
simulation  of  Mach  14  or  below  hypersonic  flow  does  not  concern  the  presence  of  free 
electrons,  the  chemistry  model  can  further  be  reduced  to  include  only  reactions  (1),  (2), 
(3),  (4)  and  (5),  and  the  species  O2,  Ny,  NO,  O  and  N. 
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Effects  of  Rate  Data  Uncertainties  on  Flow  Conditions 

The  rate  constants  for  the  reactions 

O2  +  M  ♦*  O  +  O  +  M,  N?  +  M»*N  +  N  +  M  and  NO  +  M»*N  +  0  +  M 

have  been  varied  systematically  to  srcdy  the  effects  of  their  uncertainties  on  the  flow 
properties.  The  tests  performed  are  listed  in  Table  II  below. 


V  N2  +  M  =  2N  +  M  3 

VI  N2  +  M  =  2  N  +  M  1/3 

VII  N2  +  N2  =  2N  +  N2“  3 

VIII  N2  +  N2  =  2  N  +  N2  1/3 

DC  N2  +  N  =  2N  +  N  2 

X  N2  +  N  =  2N  +  N  1/2 


XI  NO  +  M-  N  +  O  +  M  4 

XII  NO  +  M  -  N  +  O  +  M  1/4 
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Reaction:  O2  +  M  **  O  +  O  +  M 

The  sensitivity  of  the  flow  field  to  the  factor  of  2  uncertainty  in  the  oxygen  dissociation 
rate  constants  is  examined  in  the  following  manner.  The  flow  field  is  predicted  using  the 
upper  and  lower  values  of  the  rate  constant.  The  results  from  the  two  cases  are  compared 
to  determine  the  effects  on  the  flow  properties  in  the  relaxation  zone.  A  comparison  of 
the  results  of  Cases  I  and  II  in  Table  II  indicates  that  a  factor  of  2  uncertainty  is  predicted 
to  have  significant  influence  on  the  temperature  profile.  The  effect  becomes  larger  as  the 
free  stream  Mach  number  increases  from  8  to  14.  At  Mach  14  the  predicted  temperature 
profile  can  be  off  by  as  much  as  1,000  K,  based  on  a  pre-shock  temperature  of  322  K. 
This  can  be  seen  in  Figure  11.  The  least  affected  is  the  static  pressure. 

Cases  III  and  IV  are  for  examining  the  influence  of  the  less  efficient  collision  partners 
on  the  over-all  dissociation  rate  of  oxygen.  When  the  reactions  O?  -i-  N  **  20  +  N  and 
Oo  -t-  NO  *»  20  4-  NO  are  shut  off,  no  significant  changes  in  the  results  are  found. 
There  are  two  reasons  for  the  insensitivity  of  the  flow  field  to  these  two  reactions.  The 
obvious  one  is  NO  and  N  are  much  less  effective  than  the  other  collision  partners.  A 
more  important  reason  is  that  the  concentrations  of  NO  and  N  are  very  small  in  the  early 
part  of  the  relaxing  flow  field.  The  amount  of  NO  and  N  increase  with  the  distance 
behind  the  shock  front.  By  the  time  they  reach  a  higher  concentration,  the  flow  has 
become  closer  to  equilibrium. 

Reaction:  +  M 

In  Cases  V  through  X,  the  sensitivity  of  flow  properties  to  the  uncertainties  in  the  rate 
contants  for  nitrogen  dissociation  are  tested.  At  Mach  8  and  12,  the  flow  properties  appear 
to  be  unaffected.  At  Mach  14,  the  temperature  profile  is  only  slightly  sensitive  to  the 
uncertainty  as  shown  in  Figure  12. 
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The  results  presented  above  suggest  that  the  thermal  dissociation  of  O?  is  much 
more  dominant  than  that  of  in  hypersonic  flow  fields  below  10,000  K.  Furthermore, 
the  relaxation  of  the  temperature  is  very  sensitive  to  the  uncertainty  factor  of  2  in  the 
existing  rate  data  for  O?  dissociation.  The  reason  for  the  high  sensitivity  is  that  at  the 
beginning  of  the  relaxation  zone  the  composition  is  mainly  Ny  and  O?.  The  competing 
reactions  are  the  dissociations  of  these  two  species  only.  The  smaller  dissociation  energy 
of  Oj,  which  is  about  half  of  that  of  N2,  allows  the  dissociation  of  O?  to  proceed  at  a 
much  higher  rate.  Hence  the  flow  properties  are  more  sensitive  to  this  reaction. 
Consequently,  the  uncertainties  in  the  O2  rate  constants  can  cause  significant  deviations  in 
the  temperature  of  a  predicted  flow  field. 

Reaction:  NO  +  M^N+0  +  M 

The  rapid  production  of  oxygen  atoms  behind  the  shock  front  promotes  the 
production  of  NO  via  the  reactions 


N,  +  O  -*  NO  +  N 

and  N  +  O2  -*  NO  +  O  . 

Note  that  these  reactions  do  not  alter  the  concentration  of  atomic  oxygen.  Figure  13 
shows  the  contributions  of  these  two  reactions  and 

NO  +  M-  N  +  O  +  M 

to  the  change  in  the  mass  fraction  of  NO  with  respect  to  the  distance  behind  the  shock 
front.  The  results  indicate  that  nitric  oxide  is  produced  by  the  two  shuffle  reactions  and 
simultaneously  consumed  in  small  amount  by  dissociation.  The  negative  rate  of  change  of 
the  NO  mass  ''raction  due  to  dissociation  suggests  that  NO  is  not  formed  from  the 
association  of  N  and  O  atoms. 
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Jlie  rate  constants  for  NO  ■¥M**N  +  0-¥M  have  been  varied  by  factors  of  114 
and  4  to  study  the  sensidvity  of  the  flow  properties.  As  already  mentioned  in  the  rate 
constant  review  section,  the  reliability  of  the  existing  NO  dissociation  rate  data  cannot  be 
determined.  The  factor  of  four  used  here  is  an  assumed  factor  of  uncertainty.  Figures  14 
and  15  show  the  sensitivity  of  the  species  mole  fractions  and  flow  properties,  respectively, 
for  a  Mach  14  normal  shock.  Figure  14  indicates  that  a  factor  of  4  increase  in  the 
dissocation  rate  reduces  the  NO  peak  mole  fraction  and  increases  the  concentrations  of 
N  and  O  more  rapidly.  The  higher  concentration  of  N  atoms  at  x  =  0.5  ram  to  1  mm 
enables  the  reaction 


N  +  O2  -  NO  +  O 

to  proceed  at  a  higher  rate.  This  results  in  a  more  rapid  depletion  of  oxygen  molecules 
and  a  higher  equilibrium  mole  fraction  of  NO. 

Figure  15  shows  that  temperature  is  very  sensitive  to  the  factor  of  4  uncertainty  in 
the  rate  constant  of  the  reaction:  NO  +  M-«’N  +  0  +  M.  At  Mach  14  the  predicted 
temperature  profile  deviates  by  as  much  as  930  K,  based  on  a  pre-shock  temperature  of 
322  K-  The  least  affected  is  the  static  pressure.  Figure  15  also  indicates  that  the 
temperature  decreases  at  a  higher  rate  when  the  dissociation  rate  of  NO  is  increased  by 
a  factor  of  4,  This  is  due  to  the  consumption  of  a  larger  amount  of  internal  energy  when 
the  endothermic  dissociation  of  NO  proceeds  at  a  higher  rate. 
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CONCLUSION 


The  reactions  and  species  that  are  important  to  the  non-equilibrium  chemistry  of 
air  at  temperatures  below  10,000  K  have  been  identified  by  examining  the  sensitivity  of  the 
flow  properties  behind  a  normal  shock  to  the  chemistry  model.  By  limiting  the  maximum 
post-shock  temperature  to  10,000  K,  equivalently  the  maximum  Mach  number  of  the  shock 
to  14,  the  reliability  of  the  results  is  ensured  by  using  a  set  of  reaction  rate  data  taken  in 
the  temperature  range  of  the  flow  fields  and  a  chemistry  model  made  up  of  reactions 
known  to  be  important  below  10,000  K  in  air. 

The  uncertainty  factor  of  2  in  the  rate  data  for  oxygen  dissociation  have  been 
predicted  to  introduce  as  much  as  1,000  K  deviation  in  the  temperature  profile.  Similar 
degree  of  deviation  in  the  temperature  has  also  been  predicted  when  an  assumed  factor 
of  4  uncertainty  was  introduced  into  the  dissociation  rate  constant  of  NO.  Although  the 
flow  in  the  present  study  is  one-dimensional,  the  results  suggest  that  comparable  deviations 
in  the  predicted  temperature  may  occur  in  the  fore  portion  of  the  non-equilibrium  flow 
field  between  a  hypersonic  vehicle  and  its  leading  bow  shock. 

As  a  result  of  lower  rate  of  N2  dissociation  below  10,000  K,  tne  flow  properties  are 
much  less  sensitive  to  the  larger  uncertainty  in  the  rate  data  for  this  reaction.  Thus  better 
rate  data  for  oxygen  and  nitric  oxide  dissociations  ^re  needed  for  reliable  predictions  of 
hypersonic  flow  while  the  existing  uncertainty  in  the  nitrogen  data  is  expected  to  introduce 
insignificant  deviations. 

Out  of  the  initial  twenty  reactions  and  eight  species  in  the  chemistry  model,  eighteen 
reactions  and  seven  species  have  been  identified  to  be  essential  to  the  chemistry  of  air  at 
temperatures  below  10,000  K.  One  of  the  reactions  is  the  ionizing  atomic  collision  N  + 
O  ♦*  NO'*’  -t-  e*.  This  reaction  and  the  two  charged  species  can  be  eliminated  when  the 
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objective  of  a  numerical  simulation  of  hypersonic  flow  is  not  concerned  with  the  presence 
of  free  electrons. 

Compression  or  expansion  of  the  flow  and  vibrational  non-equilibrium  effects  have 
been  neglected  in  this  study.  Any  attempt  to  extrapolate  the  present  results  to  other  flow 
fields  should  be  done  with  careful  consideration  of  these  factors. 
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TABLE  I 


RATE  CONSTANTS 


Reaction 

M 

Sources  of 

Forward  Rates  Forward 

Rates 

Equilibrium  Constants 

(1) 

O, 

O 

N^ 

NO,  N 

7.6xl(y‘^exp(.52,760/T) 

9.6  X  k(M=Oo) 

0.25  X  k(M=Oo) 
7.87xl0^^exp(-52,760yT) 

See 

Text 

7.24xl02^7-^exp(-59,400A') 

(2) 

No 

n' 

N0,N,0 

SxlO^^T--  7^exp(-113,200/T) 
3xl02^-^"yexp(-l 13,200/T) 
1.9x10^  7T--^exp(-113,200/T) 

See 

Text 

I08xl0^^exp(-U3,200/T) 

(3) 

Oo,N2 

N,0,NO 

9xlO^‘^exp(-75,400/T) 

8xl02^T-^"^exp(-75,400/T) 

Wray  & 
Teare 
(1962) 

1.45x10^-^  exp(-75,400/T) 

(4) 

1.84xl(y‘^exp(-38, 38477) 

Monat 
"“et  a] 
(1978) 

4.545  exp(-38,000/T) 

(5) 

2.63x10^  exp(-19,430/T) 

Hanson 
et  al 
(1974) 

2.4x10*^  exp(-16,l00/T) 

(6) 

5.18x10^  ■^exp(-31,900/T) 

Bortner 

(1969) 

3.58xl(y^^"^exp(-31,900/T) 

Concentration  in  moles/cc,  Temperature  in  Kelvins 
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Figure  2  ;  Comparison  of  Rate  Constants  for  N2  +  -►  N  +  N  +  N? 
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Figure  5  :  Species  Mole  Fractions  behind  a  Mach  8  Normal  Shock. 
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Figure  7  :  Species  Mole  Fractions  behind  a  Mach  12  Normal  Shock, 
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Figure  9  ;  Species  Mole  Fractions  behind  a  Mach  14  Normal  Shock. 
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Figure  10  :  Profiles  of  Flow  Properties  behind  a  Mach  14  Normal  Shock. 
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BHO/RHOl  MflCH  NUMBER 


Figure  13  :  Change  of  NO  Mass  Fraction  due  to  Different  Reactions  at  M  =  14. 
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- and  kj,  x  1/4 

-  and  kj,  X  4 

Figure  14  :  Deviations  in  the  Predicted  Species  Mole  Fractions  due  to 
a  factor  of  4  uncertainty  in  the  Rate  Data  of 
NO  -f-M«*N-f-0-t-M  for  a  Mach  14  Normal  Shock. 
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Figure  15  :  Effects  of  the  Uncertainty  in  the  Rate  Constant  of 


NO  +  M**N  +  0  +  M  on  the  Flow  Properties  at  M 
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ABSTRACT 


A  simulation  model  of  lateral  resupply  of  spare  parts  in  a 
multi-echelon  inventory  system  was  developed  in  order  to 
answer  two  major  questions.  What  is  the  effect  of  any 
resupply  policy  on  backorders  at  the  bases?  What  is  the 
optimal  lateral  resupply  policy?  Emphasis  was  placed  on  the 
latter  question. 
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I. INTRODUCTION: 


The  Directorate  of  Management  Sciences  (XRS)  of  the  Air  Force 
Logistics  Command  (AFLC)  conducts  research  on  various  issues 
in  logistics.  In  general,  XRS  relates  logistics  decisions  to 
peacetime  readiness  and  wartime  sustainability.  One  area  in 
which  XRS  has  become  expert  is  in  using,  modifying  and 
developing  methods  and  models  which  are  useful  for  recover¬ 
able  items.  These  are  items  which  are  repaired  rather  than 
discarded  upon  failure.  A  major  focus  of  the  Assessment 
Applications  Division  (XRSA)  is  on  the  determination  of  the 
capability  of  the  Air  Force  to  fly  its  planned  missions  as  a 
function  of  the  spare  (recoverable)  parts  and  the  repair 
policies  used  for  spare  recoverables .  A  major  tool  which  is 
used  to  provide  this  assessment  is  AFLC's  Weapons  Systems 
Management  Information  Systems  (WSMIS)  and  a  major  component 
of  WSMIS  is  dyna-METRIC.  (dynamic  multi-echelon  technique  for 
recoverable  item  control,  see  Hillestad ( 1982 )  and 
Pyles(1984) )  . 


My  expertise  is  in  management  science/ operations  research.  In 
particular  I  have  done  a  great  deal  of  research  on  inventory 
systems,  queuing  systems  and  scheduling  systems.  The  recover¬ 
able  item  inventories  are  a  hybrid  of  inventory,  queuing  and 
scheduling  systems  in  that  spares  (the  inventory  portion  of 
the  hybrid)  are  provided  and  failed  parts  are  queued  or 
scheduled  for  repair  (rather  than  being  discarded) .  Thus, 
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there  is  a  good  fit  between  my  inventory/queueing/scheduling 
expertise  and  the  problems,  methods  and  models  of  XRS,  in 
particular,  with  respect  to  recoverable  items. 

II.  OBJECTIVES  OF  THE  RESEARCH  EFFORT: 

The  particular  problem  that  I  have  worked  on  is  the  problem 
of  lateral  resupply  among  bases.  For  the  most  part,  the 
current  models  (dyna-METRIC  and  WSMIS)  do  not  assume  that  it 
is  possible  for  bases  to  supply  each  other  with  spare  parts. 
In  the  current  models  all  spares  move  up  or  down  the  echelons 
from  base  to  depot  and  from  depot  to  base.  In  reality,  parts 
do  move  within  an  echelon  and  this  leads  to  two  different 
problems.  The  first  problem  is  to  determine  the  best  way  to 
move  these  parts  (i.e.  determine  the  optimal  lateral  resupply 
policy)  and  the  second  problem  is  to  assess  the  effect  of  any 
lateral  resupply  policy  on  mission  capability.  Slay  (1986) 
has  worked  in  the  latter  area  but  other  than  for  this  paper 
there  has  been  very  little  research  in  the  area  of  lateral 
resupply.  My  main  concern  is  with  the  former  problem. 

III.  MULTI  ECHELON  INVENTORY  SYSTEMS: 

One  general  concern  at  XRSA  is  with  multi-echelon  inventory 
systems.  The  major  system  used  to  model  the  multi-echelon 
inventory  systems  is  WSMIS.  Items  arrive  at  the  bases  in 
need  of  repair.  If  the  base  has  spare  items  it  will  provide 
these  items  to  the  customer.  If  the  base  has  no  spares  then 


the  customer  waits  for  either  a  resupply  from  the  depot  or  a 
repair  at  the  base.  In  either  case  the  item  itself  is  placed 
in  repair  either  at  the  base  or  the  depot.  While  this  is  the 
representation  of  the  inventory  system  as  expressed  in  WSMIS 
this  representation  does  not  allow  for  the  movement  of  spares 
within  a  single  echelon  in  the  inventory  system.  That  is, 
WSMIS  does  not  allow  for  the  movement  of  spares  from  base  to 
base  or  for  that  matter  from  depot  to  depot.  This  movement 
can  occur  and  it  is  precisely  this  intra-echelon  movement  of 
spare  parts  which  has  been  the  focus  of  my  summer  research. 

The  system  which  needs  to  be  considered  is  one  as 
follows.  Items  can  move  up  and  down  echelons  (by  moving 
between  the  bases  and  the  depot  as  modelled  by  WSMIS)  and  in 
addition  items  can  move  within  the  same  echelon  (by  moving 
between  bases.)  The  analysis  which  is  necessary  is  two-fold. 
One  area  of  research  is  to  find  the  optimal  policy  for  the 
lateral  resupply  of  parts.  The  second  area  is  to  perform  the 
analysis  in  a  fashion  similar  to  Dyna-METRIC.  That  is,  given 
a  set  of  rules  which  govern  the  lateral  resupply  find  the 
impact  on  readiness  and  sustainability  of  various  lateral 
resupply  policies.  The  latter  has  been  performed  to  some 
extent  by  Slay  (1986).  My  concern  is  primarily  with  the 
former  -  finding  the  optimal  lateral  resupply  policy. 
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IV.  THE  GENERAL  DESCRIPTION  OF  LATERAL  RESUPPLY: 


In  this  section  we  describe  the  lateral  resupply  in  opera¬ 
tions  research  terms.  The  lateral  resupply  optimization  prob¬ 
lem  is  a  dynamic  program.  Unfortunately,  as  we  will  discuss 
shortly  the  state  space  for  this  dynamic  program  is  much  too 
large  to  yield  any  useful  results.  Therefore  ultimately,  to 
provide  useful  rules  I  have  written  a  simulation  which  will 
model  lateral  resupply. 

We  make  the  following  definitions: 

M  =  the  number  of  bases 

T  =  the  length  of  time  of  interest  (in  days) 

S(i,t)  =  the  serviceable  stock  at  base  i  =  1,2, ...,M  on 
day  t=l,2, . . . ,T. 

X(i,j)  =  the  mean  length  of  time  to  transport  the  parts 
from  base  i  =1,2,...,M  to  base  j?*i.  As  a  general  rule  this 
length  of  time  is  deterministic.  It  takes  1  day  to  ship  parts 
from  an  onshore  base  to  an  onshore  base  or  from  an  offshore 
base  to  an  offshore  base  and  3  days  to  ship  parts  from  an 
onshore  base  to  an  offshore  base. 

r  =  mean  repair  time. 

D(i,t)  =  the  number  of  items  due  in  at  base  i  on  day  t. 
These  items  consist  of  those  which  will  be  repaired  as  well 
as  those  which  are  arriving  due  to  a  lateral  resupply  from 
some  other  base. 

The  decision  which  needs  to  be  made  on  any  given  day  t  is 
denoted  by  a  matrix  d(i,j)  where  d(i,j)  is  the  number  of 
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units  shipped  from  base  i  to  base  j  .  We  now  can  describe  the 
state  space  for  the  decision  model.  On  any  day  t  the  state  is 
described  as 

state=(S (i, t) ,i=l,2, . . . ,M;D(i,z) ,i=l,2, . . . ,M:z=t+l,t+2, . . . ,T) 

That  is,  the  information  which  is  relevant  to  decision  making 
is  the  number  of  units  in  stock  at  each  base  and  the  number 
of  units  that  are  going  to  return  to  the  base  from  the  pipe¬ 
line  in  future  days.  This  is  an  unwieldy  state  space  for  even 
the  smallest  problems.  Considering  the  size  of  the  state 
space  it  seems  very  unlikely  that  an  approach  such  as  dynamic 
programming  can  ever  find  the  solution.  The  problem  would 
have  to  be  very  small  for  DP  to  work.  This  leads  to  two 
possibilities.  One  possibility  is  to  simulate  the  system.  The 
second  possibility  is  to  collapse  the  state  space.  This 
second  option  has  been  used  in  perishable  inventory  where  21 
separate  days  worth  of  information  are  collapsed  into  their 
sum.  (see  Nahmias ( 1975) ) .  The  collapsing  approach  seems 
unreasonable  for  the  problem  of  lateral  resupply  because  the 
structure  of  the  pipeline  is  critical  in  determining  whether 
or  not  items  should  be  shipped  from  one  base  to  another.  We 
think  that  by  collapsing  the  state  space  the  distinctions 
between  near  and  far  pipeline  items  will  be  lost  and  ineffec¬ 
tive  policies  will  be  developed.  Even  if  this  is  not  the  case 
the  state  space  would  still  be  rather  large  for  the  size  of 
problem  that  must  ultimately  be  considered.  Therefore  we 
opted  for  the  simulation  system. 
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V.  DESCRIPTION 


Because  an  analysis  or  optimization  technique  is  infeasible 
it  is  necessary  to  use  a  simulation  to  determine  the  effects 
of  policy  changes  in  lateral  resupply  on  backorders.  Simula¬ 
tion  will  not  yield  the  optimal  lateral  resupply  policy  but  a 
properly  written  and  properly  used  simulation  should  lead  us 
in  the  direction  of  the  optimal  policy. 

The  first  question  was  whether  the  simulation  should  be 
designed  for  a  micro  computer  or  a  main  frame.  Because  the 
goal  of  the  simulation  is  to  test  various  lateral  resupply 
operating  rules  the  simulation  must  be  convenient  to  use  and 
user  friendly.  This  is  most  easily  accomplished  if  the 
simulation  is  written  for  a  micro  computer.  One  concern  which 
we  had  with  using  a  micro  computer  was  that  a  simulation 
program  would  take  too  much  (real)  time.  However,  this  has 
not  been  the  case  and  real  time  has  not  posed  a  problem.  One 
problem  that  arises  is  the  fact  that  the  number  of  variables 
is  more  limited  on  a  micro-computer  than  on  a  main-frame. 
However,  keeping  in  mind  that  the  objective  of  the  simulation 
is  to  play  with  the  policies  in  order  to  determine  useful 
rules,  all  runs  have  been  kept  relatively  small  anyway  and 
this  was  not  an  issue.  (We  do  feel  that  after  the  policies 
are  developed  they  should  be  tested  on  a  large  scale  and  this 
is  discussed  at  the  end  of  this  section.) 

The  simulation  system  consists  of  three  major  portions.  The 
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first  part  is  an  editing  system  which  allows  quick  and  easy 
editing  of  both  the  data  and  the  lateral  resupply  rules  which 
are  used  as  input  to  the  simulation.  The  second  part  is  the 
actual  simulation  itself  while  the  third  part  is  a  procedure 
for  taking  the  output  from  the  simulation  and  turning  the 
output  into  useful  reports. 

The  specific  data  which  must  be  input  to  the  simulation  are 
in  accompanying  documention  and  user's  guide. 

The  difficult  part  of  the  simulation  was  setting  up  the  data 
base  and  simulation  program  to  handle  a  wide  variety  of 
rules.  The  rules  were  organized  in  order  that  bases  which 
need  units  will  be  given  some  priority  measure  and  that  bases 
that  have  excess  units  will  be  given  some  measure.  The  two 
are  obviously  related  since  the  closer  that  bases 
are  to  one  another  the  more  likely  it  is  that  a  lateral 
resupply  will  be  useful.  This  is  the  part  of  the  simulation 
which  required  the  greatest  thought  in  order  to  make  the 
simulation  useful. 

The  output  reports  are  described  in  the  accompanying 
documentation  and  user's  guide. 

The  long  term  lateral  resupply  simulation  system  which  we 
envision  is  as  follows.  Two  versions  should  exist  as  it 
should  be  possible  to  perform  quick  experiments  on  a  micro¬ 
computer  and  large  experiments  on  a  mainframe  computer.  There 
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should  be  a  linking  between  the  data  sets  so  that  any  data 
set  can  be  used  on  either  type  of  computer.  This  linking  is 
easy  to  achieve  with  the  use  of  modems.  Similarly  output 
reports  from  either  system  should  be  capable  of  being  pro¬ 
duced  by  either  a  main  frame  or  microcomputer.  Again,  this  is 
easy  to  accomplish  with  the  help  of  a  modem.  In  addition  we 
envision  a  linking  of  WSMIS  runs  and  the  lateral  resupply 
simulations.  It  is  possible  that  WSMIS  runs  will  generate  the 
problem  parts  which  reguire  further  study  for  lateral  resup¬ 
ply  policies.  One  additional  advantage  of  having  the  system 
on  two  types  of  machines  is  that  the  accuracy  of  the  results 
can  be  checked  by  running  the  same  data  on  two  different 
machines. 
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ABSIRACT 

A  project  to  develop  hardware  and  software  neces£:xy  to  control  a  carbon 
fiber  spinning  system  using  expert  system  control  was  completed.  The 
equipment  necessary  to  ccaivert  the  process  to  computer  control  was  acquired 
and  tested  in  the  fiber  spinning  system.  Additicml  software  necessary  to 
control  the  new  equipment  was  developed  and  tested.  Limited  scale  testing  of 
fiber  spinning  was  accomplished,  and  additional  equipment  needs  were 
identified. 
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I .  INTRODUCTION 


Ihe  developnent  of  a  microconiputer  based  system  for  control  of  carbon 
fiber  spinning  from  pitches  is  of  great  use,  because  it  allows  spinning  of  a 
wider  variety  of  carbon  fibers,  and  at  the  same  time  allows  for  the  natural 
variability  of  pitch  feedstocte.  Conv^rtional  analog  control  of  the  process 
provides  less  than  optimal  control  of  the  process  and  can  not  easily  respond 
to  variability  in  feedstock.  The  major  effort  in  this  project  was  in  the 
construction  and  testing  of  control  units  and  sensors  vAiich  were  connected  to 
a  microcomputer.  Thus,  a  great  deal  of  this  report  will  deal  with  equipment 
selection,  construction,  calibraticai,  and  troubleshooting. 


II.  OBJECTIVES  OF  THE  RESEARCH  EFFORT 

The  objectives  of  the  project  included  the  construction  and  testing  of  an 
the  hardware,  as  well  as  the  development  of  software,  for  the  expert  system 
control  of  carbon  fiber  spinning.  The  hardware  chosen  was  to  be  computer 
controllable,  and  any  necessary  interface  hardware  and  software  was  to  be 
produced.  The  original  fiber  spinning  equipment  did  not  have  the  capability 
of  computer  interfacing,  so  selection  of  equipment  for  this  purpose  was 
important  to  the  project.  In  evaxuating  the  suitability  of  the  selected 
equipment,  one  of  the  objectives  of  the  project  was  determination  of  the 
interdependence  of  the  control  parameters,  a..d  their  effects  upon  the  quality 
of  the  spun  fibers.  Fiber  spinning  runs  were  to  be  made,  and  the  suitability 
of  the  control  devices  and  mechanical  equipnent  selected  were  to  be  evaluated. 

III.  HARDWARE  SELBCnOI  AND  DEVELOPMENT 

The  first  phase  of  the  project  was  the  selection  of  hardware  necessary  to 
convert  existing  fiber  spinning  equipment  to  allow  computer  control,  and  also 
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the  acquisition  of  the  computer  control  equipment.  The  computer  equipment 
chosen  was  a  system  siioilar  to  that  used  in  the  developinent  of  the  Air  Force 
QPA  system  (Leclair  et  al.,  1987).  The  main  microccmpjter  computer  components 
are  IBM  AT  compatible  computer,  a  Data  Translation  2805  A/D-D/A  board,  an 
Omega  2010  tenperature  controller,  and  a  Mototrolla  68000  coprocessor  board. 
Recent  modifications  to  the  QPA  expert  system  shell  have  allowed  the  use  of  a 
single  68000  board,  ax>d  with  scane  siiiple  modifications  to  the  cxxie,  one  board 
has  appeared  to  be  adequate  at  present.  A  second  68000  board  has  been 
acquired  should  the  single  board  system  prove  inadequate  in  the  future. 

A  schematic  diagram  of  the  spinning  system  modified  for  conpiter  control 
is  shown  in  Figure  1  (see  Appendix  I) .  The  system  was  modified  to  accommodate 
a  screw  feed  and  a  motor  driven  pump  drive.  This  allows  direct  ccxitrol  of 
feed  rate  and  nekes  the  system  essentially  a  ccMitinuous  operation.  This  type 
of  spinning  (known  as  screw  melt  spinning)  requires  that  the  feed  pitch  be 
preprocessed  to  bring  it  to  mesophase,  and  then  cooled  and  ground  to  allow  dry 
feeding  in  the  screw  feed.  The  m6tors  which  drive  the  screw  feed  and  melt 
punp  are  of  the  same  size  and  were  chosen  to  provide  motor  speed  on  the  order 
of  10  rpm.  Bodine  motors  were  chosen,  because  manufacturer's  specifications 
indicated  that  both  motor  spe^  and  torque  could  be  cx>ntrol].ed  or  sensed.  The 
original  intent  was  that  the  motor  speed  be  controlled  and  the  torque  be 
sensed.  It  was  found  however  that  U-s,  torque  could  not  be  sensed  when  speed 
was  controlled,  because  the  speed  control  degraded  the  torque  signal.  Thus 
the  motors  required  either  speed  or  torque  control,  but  the  two  modes  could 
not  be  run  simultaneously.  A  third  motor  of  higher  speed  rating  is  connected 
to  the  wind  up  drum.  Speed  control  is  ijraportant  for  the  wind  up,  however  the 
tension  in  draw  line  is  controlled  by  the  torque  supplied  by  the  motor.  Speed 
control  was  chosen,  since  the  same  mechanisms  of  control  could  be  used  as  for 
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the  other  motors,  and  because  speed  control  can  be  successful  as  long  as 
process  conditions  do  not  change  rapidly. 

With  the  loss  of  torque  measurement  ability,  the  system  could  not  be 
equij^)ed  to  sense  any  measure  of  the  pitch  viscosity  at  the  melt-spin  stage. 
Examination  of  the  possibility  of  in-line  viscosity  determination  with  orifice 
or  Poiseulle  tube  devices  led  to  the  conclvision  that  they  could  not  readily  be 
implemented  without  seriously  reducing  the  quality  of  the  product  as  well  as 
control  of  the  process. 

Bodine  motor  controllers  were  chosen  for  all  of  the  motors.  These 
controllers  are  PID  based,  and  are  matched  to  the  motors  to  provide  smooth, 
acoorate  control.  Motor  control  conetjands  are  sent  throu^  the  DT  2805  board 
as  digital  signals.  A  digital  routing  network  was  designed  and  oOTistructed  to 
allow  the  available  16  bits  of  digital  output  from  the  CfT  2805  to  control  all 
three  motors  with  sequential  signals.  A  schematic  diagram  of  the  routing 
network  is  given  in  Figure  2.  Ten  of  the  sixteen  bits  of  a  digital  write  are 
used  to  set  the  fraction  of  total  output  (  zero  to  1023,  with  1023  as  full 
output) .  The  other  six  bits  are  used  to  signal  a  speed  change  and  to  choose 
the  device  to  be  controlled. 

The  motors  were  calibrated  to  relate  the  speed  to  the  ten  bit  control 
signal  sent  by  the  routing  network.  Figures  3  and  4  show  the  curves  for 
motors  A  and  B  which  are  used  by  the  screw  feed  and  punp.  Figure  5  shews  both 
curves  plotted  on  the  same  graph,  and  it  appears  that  they  respond  according 
to  the  same  cxirve.  When  the  variation  of  the  cciubined  curves  is  calculated 
and  cempared  to  a  fourth  order  polynomial  fit  to  the  data,  the  control  is 
found  to  be  acciorate  to  0.5%  as  compared  to  0.1%  suggested  by  the  10  bit 
control.  Seme  of  the  variation  can,  however  be  attributed  to  inaccuracy  in 
the  measvtring  the  speed  at  the  lew  rates  of  these  motors.  The  speeds  could 
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not  be  measured  by  tachometer,  because  the  speeds  were  on  the  order  of  one  to 
ten  rpm.  The  calibration  curve  of  the  higher  speed  windup  motor  is  shown  in 
Figvire  6.  The  apparent  jump  in  the  curve  at  the  lower  speed  is  due  to 
switching  from  manual  rotation  measurement  to  tachometer.  It  is  not  likely 
that  the  motor  will  run  for  significant  times  at  these  low  speeds  anyway.  It 
is  important  to  note  that  all  of  the  motors  have  a  low  end  clipping  of  input, 
so  that  the  lowest  few  per  cent  of  speed  control  is  not  available.  This 
should  not  cause  significant  problems. 

The  other  controlled  device  is  a  heater  in  the  melt  chamber  between  the 
screw  feed  and  the  pump.  The  heater  is  controlled  by  an  Omega  CN2010 
tenperature  controller.  Ihe  CN2010  is  equipped  with  an  RS232  interface  and 
an  internal  microprocessor  to  direct  sensing  and  control  through  the  serial 
port  as  ASCII  code  signal  sequences.  Temperature  is  controlled  to  the  nearest 
degree  over  a  range  of  60  to  500  Celsius. 

Pictures  of  the  hardware  of  the  spinning  system  are  shown  in  Figures  7- 
10.  Figxire  7  shows  the  spin-melt  and  hot  box  portions  of  the  top  of  the 
ajparatus,  and  Figure  8  shows  the  bottom  half  of  the  hot  box  and  the  windup 
drum  with  its  motor.  Figure  9  includes  the  component  parts  of  the  spinneret, 
and  Figure  shows  the  digital  routing  network  (gray  box)  and  motors  connected 
to  computers.  The  controllers  are  the  partially  obscured  black  boxes  to  the 
left  of  the  motors. 

IV.  SOFTWARE  DEVELOFMEMjl' 

Since  the  QPA  system  is  written  in  FORTH  language,  new  control  words  for 
the  new  devices  were  written  in  FORTH.  All  of  the  changes  needed  for  these 
control  words  were  made  in  the  source  file  originally  called  AUT.SRC.  The 
source  file  was  renamed  to  DEVICES. SRC,  because  an  autoclave  was  no  longer  the 
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control  device.  Some  words  remain  frcm  the  original  source  code,  and  the  new 
code  has  been  documented  to  aid  in  londer standing  of  the  logic  used.  The 
protocol  used  for  the  control  words  used  to  control  the  CN  2010  is  determined 
by  its  internal  processor,  and  so  the  manual  for  its  operatiOT  was  used  to 
determine  the  proper  ASCII  sequences  for  reading  temperature  and  writing 
set  points.  The  protocol  for  the  words  sent  to  the  digital  routing  networic 
for  the  motors  is  arbitrary,  although  it  is  determined  to  some  degree  by  the 
D/A  chips  used  to  convert  the  digital  signal  to  a  form  acceptable  by  the 
controllers. 

All  of  tlje  original  PCRIH  words  used  to  direct  the  DT  2805  D/A-A/D  board 
in  the  original  QPA  iirplementation  for  autoclave  curing  of  composites  also 
appear  in  DEVICES. SRC.  Several  new  words  have  been  added  to  xitilize  the 
digital  write  capability  of  the  board  in  the  control  of  the  motors.  Since 
testing  thus  far  has  only  utilized  manual  control,  the  words  necessary  for 
this  control  have  also  been  included. 

The  project  did  not  progress  to  the  point  where  an  ejqjert  system  rule 
base  could  be  constructed.  Manual  control  runs  were  performed  using  the 
constructed  system,  and  ongoing  work  beyond  the  time  frame  of  this  project  is 
continuing  tcward  the  development  of  a  woricable  knowledge  base. 

V.  REOCMMENDa^IONS 

The  difficulties  encountered  with  the  inability  to  sense  and  control  both 
speed  and  torque  control  reduced  the  ability  of  the  system  to  sense  OTie  of  the 
key  parameters  of  fiber  spinning  with  mesophase  pitch.  It  is  clear  that  some 
form  of  viscosity  measurement  is  necessary  for  successful  expert  system 
control.  The  most  easily  achievable  technique  >hich  could  be  employed  is  the 
indirect  sensing  of  torque  on  the  spinning  pump.  Plans  are  being  drawn  to  add 
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torque  sensing  devices  on  the  shafts  of  the  spinning  and  wind  up  motors. 
These  will  allow  pitch  viscosity  sensing  and  draw  tension  respectively. 
Furthermore  idler  tension  control  is  also  being  examined  for  use  in 
conjunction  with  torque  measurement. 

At  present,  no  in  situ  viscosity  measurement  device  is  commercially 
available  for  moving  fluids  at  the  temperatures  experienced  in  this  setting. 
Should  such  a  device  become  available,  it  would  be  clearly  preferable  and 
should  be  installed  in  the  system.  Along  the  same  lines,  research  into  the 
develonsant  of  this  kind  of  device  is  reccsmnended.  Its  ^;plicability  reaches 
into  all  applications  dealing  with  mesophase  pitch,  one  of  the  more  obvious 
being  in  the  processing  of  carton-carbon  materials. 

It  is  further  reocnmended  that  temperature  within  the  hotbox  be  added  as 
one  of  the  controlled  variables.  This  should  be  done,  however  only  after 
many  rxms  have  been  made  withcwt  the  control  for  a  basis  of  comparison.  The 
same  CN  2010  controller  could  be  used  in  this  temperature  control  as  well  as 
the  melt  temperature,  because  the  controller  is  equipped  for  control  of  two 
devices.  Hi^  intensity  limits  could  be  used  as  heaters  to  avoid  ignition  of 
volatile  fumes. 

Finally,  the  expert  system  rule  base  should  be  constructed  and  tested. 
Work  is  continuing  in  this  direction  under  internal  funding  of  MaterieLLs 
Technol^^gy  Center  of  Southern  Illinois  IMiversity  at  Caitondale. 
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Figure  2.  Digital  Routing  Network 
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>>c 


create  (devices) 

cr  .(  loading  DEVICES. SRC  ) 

\ 

\  12-88  Modified  version  of  AUT.SRC  for  fiber  spinning 

\ 

\ - DT2805  A/D  D/A 

CR  . (  loading  DT2805  support  . .  ) 

\  DT2805  board  parameters: 

HEX 

02EC  constant  BASE.ADDR  BASE.ADDR  1+  constant  DT.CMND.REG 

BASE.ADDR  constant  DT. DATA. REG  BASE.ADDR  1+  constant  DT.STAT.REG 

\  DT2805  BOARD  COMMAND  CODE: 

04  constant  CMND.WAIT  02  constant  WRITE. WAIT  05  constant  READ. WAIT 

OF  constant  CSTOP  01  constant  CCLEAR  02  constant  CERROR 

03  constant  CCLOCK  OD  constant  CSAD  OE  constant  CRAD 

10  constant  CDMA  0  5  constant  DUMMY  05  constant  CSOL'T 

OC  constant  CADIN  \  SINGLE  A/D  CONVERSION 

08  constant  CDAOUT  \  SINGLE  D/A  CONVERSION 

07  constant  CDIOOUT  \  OUTPUT  DIGITAL  BYTE 

DECIMAL 

:  BINARY  2  BASE  !  ; 

\  DT2805  USER-SELECT  PARAMETERS: 

0  constant  GAIN. CODE  (  GAIN  IS  SET  TO  1  ) 

HEX 

:  CHECK. DT. STATUS  (  —  TF  ) 

DT.STAT.REG  PC@  70  AND  0=  NOT  DUP 

IF  CR  ."  fatal  error  -  illegal  DT. STATUS  register  ” 

CR  DT. STATUS  register  value  is  "  DT.STAT.REG  PC:§  . 

THEN  ; 

DECIMAL 

:  WAIT  (  N  —  )  BEGIN  DT.STAT.REG  PC@  OVER  AND  UNTIL  DROP  ; 

:  >DT.CMND.REG  (  N  —  )  CMND.WAIT  WAIT  DT.CMND.REG  PC:  ; 

:  >DT. DATA. REG  (  N  —  ) 

WRITE. WAIT  WRITE. WAIT 

BEGIN  SWAP  >R  DT.STAT.REG  PC@  OVER  XOR  R§  AND  R>  -ROT  UNTIL 
DROP  DROP  DT. DATA. REG  PCI  ; 

:  <DT. DATA. REG  (  —  N  )  READ. WAIT  WAIT  DT. DATA. REG  PC@  ; 

:  <DT.STAT.REG  (  —  N  )  CMND.WAIT  WAIT  DT.STAT.REG  PC@  ; 

:  DTSTOP  (  —  )  CSTOP  DT.CMND.REG  PCI  DT. DATA. REG  PC§  DROP  ; 

:  DTCLEAR  (  —  )  CCLEAR  >DT.CMND.REG  ; 
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;  SETFOR-DOUT  (  —  )  CSOUT  >DT.CMND.REG  ; 

:  SETUP-A/D  (  —  )  CADIN  >DT.CMND.REG  ; 

;  SETUP-D/A  (  —  )  CDAOUT  >DT.CMND.R£G  ; 

:  SETUP-DOUT  (  —  )  CDIOOUT  >DT.CMND.REG  ; 

:  SEL-CHANNEL  (  N  —  )  >DT. DATA. REG  : 

:  A/D-GAIN  (  —  )  GAIN. CODE  >DT. DATA. REG  ; 

;  DO-A/D  (  —  N  )  <'DT. DATA. REG  <DT. DATA. REG  256  *  +  ; 

:  DO-D/A  (  N  —  )  256  /MOD  SWAP  >DT. DATA. REG  >DT. DATA. REG 
;  DO-DOUT  (  N —  )  >DT. DATA. REG  ; 

HEX 

VARIABLE  DT. STATUS  VARIABLE  ERRORl  VARIABLE  ERROR! 

:  ?DT. ERROR  (  —  TF  )  <DT.STAT.REG  DUP  DT. STATUS  !  80  AND 

:  GET-DT-ERR  (  —  ) 

DTSTOP  CERROR  >DT.CMND.REG 

<DT. DATA. REG  ERRORl  1  <DT.DaTA.REG  ERROR2  I  ; 

:  .DT. ERROR  (  —  ) 

CR  ."  fatal  DT  board  error"  BELL  BELL 

CR  .”  DT. STATUS  register  valne  is  "  DT. STATUS  @  . 

GET-DT-ERR 

CR  ERRORx  register  values  are:  " 

CR  . "  byte  1  -  "  ERRORl  ^  . 

CR  ."  byte  2  -  "  ERROR!  @  .  ; 

DECIMAL 

\  :  ADIN  (  N  —  N  )  \  (  CHANNEL  —  12-BIT  VALUE  ) 

\  DTSTOP  DTCLEAR  SETUP-A/D  A/D-GAIN  SEL-CHANNEL 
\  DO-A/D 

\  ?DT. ERROR  I?  .DT. ERROR  THEN  ; 
variable  ch^ 

:  ADIN  (  N  —  N  )  \  (  CHANNEL  —  12-BIT  VALUE  ) 

chi?  !  0  10  0  do 

DTSTOP  DTCLEAR  SETUP-A/D  A/D-GAIN  ch#  SEL-CHANNEL 

DO-A/D  ?DT. ERROR  IF  .DT. ERROR  THEN  10  /  ^  loop  ; 

:  DAOUT  (  N  N  —  )  \  (  12-BIT  VALUE,  CHANNEL  —  ) 

DTSTOP  DTCLEAR  SETUP-D/A  SEL-CHANNEL 
DO-D/A  ?DT. ERROR  IF  .DT. ERROR  THEN  ; 

:  DOLT*  (  N  N  —  )  \  (  8 -BIT  VALUE ,  CHANNEL  —  ) 

DTSTOP  DTCLEAR  SETUP-DOUT  SEL-CHANNEL  DO-DOUT 
?DT.  ERROR  IF  ..DT.  ERROR  THEN  ; 

:  DA-OUT  (  N  N  —  )  \  (  16-BIT  VALUE,  CHANNEL  ~  ) 

DTSTOP  DTCLEAR  SETUP-DOUT  SEL  CHANNEL  DO-D/A 
?DT. ERROR  IF  .DT. ERROR  THEN  ; 
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HEX 


STUFF-IT  (  MOTOR-CHNL#,  SPEED  -  ) 


03FF  AND  \ 
SWAP  \ 
0400  *  \ 
+  DUP  \ 
2  DECIMAL  OA-OUT  \ 
HEX  -8000  OR  DUP  U. 
2  DA-OUT  ; 


mask  out  upper  6  bit 
get  motor-chnl# 
set  chnl  bit 

pack  channel  &  data  (16  bit) 
write  16  bit  to  I/O  port 
\  set  wr  »  1 


:  st-go  hex  -8000  2  da-out  ; 

:  TRY-IT  (  MOTOR-CHNL#,  SPEED  -  ) 

03FF  AND  \  mask  out  upper  C  bit 

SWAP  \  get  motor-chnl# 

0400  *  \  set  chnl  bit 

T  \  pack  channel  &  data  (16  bit) 

HEX  -8000  AND  U.  \  det  wr  =  1 


:  STUFF  (  MOTOR-CHNL#, 
03FF  AND  \ 

SWAP  \ 

0400  *  \ 

-  DUP  BINARY  16  .R 
2  DECIMAL  DA-OUT  \ 


SPEED - ) 

mask  out  upper  6  bit 
get  moror-chnl# 
sec  chnl  bit 

\  pack  channel  &  data  (16  bit) 
write  16  bit  to  I/O  port 


DECIMAL 

:  dt-clr  dtstcp  dtclear  setfor-dout  2  sel-channel  ; 

\  -  OMEGA  SUPPORT 

CR  . (  LOADING  OMEGA  SUPPORT  . .  ) 

80  constant  #omegabuf 
create  ‘omegabuf  #omegabuf  i-^  allot 
'omegabuf  1+  constant  omegabuf 

I 

:  clr. omegabuf  (  —  ) 

omegabuf  yomegabuf  32  fill 
0  'omegabuf  cl  ; 

defer  omegaport 
defer  clr . omegaport 
defer  ? omegaport 

;  read. omega  (  —  )  \  reads  into  omegabcf 

clr. omegabuf  begin  show-time  ’omegaport  until 
omegabuf  #omegabuf  omegaport  dos.read 
byte.cnt  @  'omegabuf  cl 
clr . omegaport  ; 

create  <cr>  13  c, 
create  <lf>  10  c, 
create  hnd. omega  0  , 

:  ?dos. write  (  buf ,  len,  handle  —  ) 

?dup  if  dos. write  then  ; 


\  'omegabuf  has  count 
\  omegabuf  is  beginning 
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:  echo . omega  (  —  ) 

^omegabuf  count 

2  -  swap  1-!-  swap  2dup  type  \  removing  leading  <lf>  and  ending  <cr> 

hnd. omega  @  ?dos. write 

<cr>  1  hnd. omega  §  ?dos. write 

<lf>  1  hnd. omega  @  ?dos. write  ;  \  append  <lf> 

:  TIMESTAi'lP  (  —  ) 

TIME$  8  hnd. omega  @  dos. write 
<CR>  1  hnd. omega  @  ?dos. write 
<LF>  1  hnd. omega  @  ?dos. write  ; 


\  OMEGA  COMMANDS 

create  wpvO  ascii  q  string  *0 : WCSP/0000 q 
create  rpvo  ascii  q  string  *0:RPV0:  q 
create  tmp.buff  50  allot 

:  str>omegaport  (  ‘adr  —  ) 

count  2dup  '  1-  13  swap  c!  omegaport  dos. write  ;  \  append  <cr> 

:  .RPVO  (  —  ) 

rpvO  str>omegaport  read. omega  ; 

:  .WPVO  (  —  ) 

wpvO  str>omegaport  read. omega  ; 

;  d/  (  d  u  -  a  ) 

SWAP  OVER  /MOD  >R  SWAP  UM/MOD 
SWAP  DROP  R>  ; 

:  omega-use-coml 

coml  filter. file. mode 
r ' ]  coml  is  omegaport 
T']  rsclea^  is  clr .omegaport 
r ' j  rsthere?  is  ? omegaport  ; 

:  cmega-use-com2 

open.com2  com2  filter. file. mode 
[ ' ]  com2  is  omegaport 
[']  com2clear  is  clr .omegaport 
[ ' ]  cora2there?  is  ?omeqaport  ; 

:  INIT-OMEGA  (  — •  ) 

omegaport  coml  =  if  0  else  1  then  1200  baud 
cr  ."  Omega  Port  is  initialized  to  1200  baud  " 
bell  clr . omegaport  bell  ; 

create  dspbuf  40  allot 

:  clr. dspbuf  (  —  )  dspbuf  40  32  fill  ; 
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;  >dspbuf  (  —  ) 

omegabuf  1+  dspbuf  ■'omegabuf  c@  2  -  amove  ; 

defer  (?dsport  defer  dsport 

:  ? dsport  (  —  tf  ) 
begin  (?dsport  until 
0  sp@  1  dsport  dos.read  11  = 
if  true  else  56  6  wat  ***”  false  then  ; 

or  .(  <devices,  ) 

:  ( stat>dsport  (  —  ) 

?dsport  if  d-comm  1  dsport  dos- write  then 
?dsport  if  dspbuf  38  dsport  dos -write  then  ; 


defer  stat>dsport  '  noop  is  stat>dsport 

<devices  —  ) 
clr . dspbuf 
get-DT 

stat>dsport  ; 


;  init-devices  (  —  ) 

cr  cr  checking  DT2805" 

check. dt. status  if  abort"  run  aborted  "  then 
cr  DT2805  OK" 

init-omega ; 

;  nodata>dsport  (  n  —  ) 
drop  ; 

:  create . omega . file  (  —  ) 

cr  create  omega  data  storage  file  ” 

[']  hnd. omega  is  file. handle 
create . a . file  ; 

:  close. omeaa, file 

hnd. omega  @  dos. close. file  ; 

:  open-files 

create . data .file  create . omega .file 
[']  (record-data  is  record-data  ; 

:  close-files 

close. data. file  close. omega. file  ; 

:  do-cleanup 

reset-controllers 
close-files  ; 


74-23 


\  Omega  2010  Temperature  Controller  words 

\ 

create  temp  5  allot 

5  temp  ci 

:  display-temp  \  display  the  temp  received  in  number 

5  0  do  temp  i  +  c@  emit  loop  ; 

;  show-temp  \  show  the  temp  received  in  ASCII  no. 

5  0  do  temp  i  +  c@  .  loop  ; 

:  use-devices  (  —  ) 

[']  (wait-for-clock  is  wait-f or-clock 

[']  open-files  is  prep-data-files 

[ ' ]  do-cleanup  is  tidy-up 

[']  ( init-setpoints  is  init-setpoints 

r ' ]  <devices  is  <monitors 

[']  (do-heat-up  is  do-heat-up 

r'l  (do-heat-hold  is  do-heat-hold 

[']  (do-heat-down  is  do-heat-down 

:  DE^^ICES  (  “  ) 

USE-DEVICES 

cr  . ”  enter  RUN  to  start  "  ; 

:  extract-temp  \  extract  the  temp  received  &  store  under 

‘omegabuf  swap  f  2  temp  1  4  cmove>  ; 

;  findcolon  \  search  for  separator  in  the  input  strinc 

'omegabuf  swap  c@  58  =  ; 

;  get-temp  \ 

‘oroegabuf  count  0  do  i  findcolon 
if  i  extract-temp  leave,  then  loop  drop  ; 

:  string  (  c  —  )  word  c@  1+  allot  ; 


:  set-temp 

temp  1  *  wpvO  9-^4  croove>  ; 

:  display-setpt 
wpvO  count  type  ; 

:  r-display 

.rpvO  get-temp  set-temp  display-setpt  ; 

:  send-auto  (  —  ) 

.rpvo  get-temp  set-temp 

wpvO  str>oraegaport  read. omega  ; 

:  send-manual  (  —  ) 

cr  . ”  Enter  set  point  xxxx  :  ” 

wpvO  9+4  expect 

wpvO  str>omegaport  read. omega  ; 

omega-use-coml 

devices 
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ABSTRACT 


Titanium  aluminides  with  strong  thermodynamically  stable 
intermetallic  phases  are  significantly  potential  advanced 
materials  for  high  temperature  applications.  The  gamma  based 
titanium  aluminides  of  the  PREP-HIP  Ti-48Al-lV,  isothermal 
compression  tested  for  several  strain  rate  conditions 
and  temperatures  from  1000  to  1350  C  were  investigated  for  phase 
behavior  and  transformations  to  include  the  transformation  at 
1285  C  in  the  TiAl  system.  Material  characterizations  were 
conducted  using  optical  metallography,  DIG  methods  and  scanning 
electron  microscopy.  Working  with  the  Air  Force  and  the 
Processing  Science  Group  of  the  Materials  Laboratory,  WPAFB,  OH, 
and  using  MIS  generated  processing  maps,  the  thermomechanical 
processing  characteristics  and  the  structural  evolutions  were 
analyzed  and  evaluated.  Evaluations  of  influence  of  micro- 
structural  variations  were  carried  out  for  the  CAST-HIP  alloy 
system  supplied  and  designated  as  alloy  B  .  The  research 
focused  on  characterization  and  evaluations  of  microstructural 
mechanisms  for  stable  material  flow  conditions  with  maximum 
efficiency  of  power  dissipation  through  metallurgical  processes 
in  these  titanium  aluminide  alloy  systems.  Detailed  examination 
of  the  microstructural  and  mechanical  behavior  was  carried  out 
as  a  part  of  understanding  the  constitutive  behavior  and  unique 
deformation  mechanisms  in  the  V  titanium  aluminide  alloy  systems. 
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INFLUENCE  OF  MICROSTRUCTURAL  VARIATIONS  ON  THE  THERMOMECHANICAL 
PROCESSING  IN  DYNAMIC  MATERIAL  MODELING  OF  TITANIUM  ALUMINIDES 

1.0  INTRODUCTION 

The  influence  of  microstructural  variations  on  the  thermo¬ 
mechanical  processing  in  dynamic  material  modeling  of  titanium 
aluminides  has  been  investigated  for  the  following  materials 
supplied  by  the  Air  Force,  UES.,  Inc  in  case  of  two  selected 
titanium  aluminides  systems: 

1.  the  PREP-HIP  Ti-48  Al-lV  material;  and 

2.  the  CAST-HIP  TiAl  material,  designated  as  alloy  B. 

The  effects  of  the  constituent  phases  and  the  processing 

conditions,  including  several  temperature  and  strain  rate  kine¬ 
matic  variable  conditions  have  been  investigated  to  understand 
and  analyze  the  microstructural  mechanisms,  and  material 
stability  in  relation  to  the  dynamic  material  modeling  DMM  maps, 
using  the  Material  Information  System  MIS  system  generated 
stability  maps.  The  DMM  stability  maps  can  be  converted  to 
devise  conditions  of  stable  constitutive  material  flow  to  shorten 
the  cycle  time  of  the  alloy  development  and  for  selection  of 
the  more  efficient  processing  conditions  related  to  micro- 
structural  mechanisms  and  supported  by  metallurgical  constituent 
phase  conditions. 

This  research  work  and  study  of  microstructural  variations 
related  to  the  dynamic  material  modeling  of  titanium  aluminides 
was  carried  on  as  follow-on  of  the  summer  faculty  research 
program  in  which  the  principal  investigator  participated  in 
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the  Processing  Science  Group  of  the  Materials  Laboratory,  WPAFB, 

MLLM,  Ohio.  The  summer  faculty  research  work  has  been  on  some 
of  the  microstructural  developments  in  the  specific  Tx-48Al-lV 
h?  -aluminide  under  constant  rate,  isothermal,  hot  deformation 
compression  conditions.  However,  the  test  temperature  conditions 
had  analyzed  the  material  behavior  only  upto  1250  C.  The  results 
of  that  study  had  indicated  the  need  to  characterize  the  behavior 
of  these  "9  -aluminides  to  include  higher  temperature  test  conditions, 
and  to  analyze  in  particular  the  microstructural  developments  in 
this  Ti-Al  system  at  and  above  the  peritectoid  transformation 
seen  at  a  1285  C  in  this  system. 

In  this  on-going  research,  the  main  goal  has  been  to  better 
understand  the  power  partitioning  processes  and  carryout  micro- 
structural  characterizations  in  the  proposed  titanium  aluminide 
alloy  systems.  To  that  end,  complete  characterization  of  the 
PREP  HIP  Ti-48Al-lV  model  material  has  been  carried  out  to  higher 
temperatures,  including  to  1350  C  at  constant  strain  rates  from 
10°  -  10  ^  S  Another  Ti-Al  base  composition  material,  and 

with  TiB2  additions  were  not  available  and  with  no  Air  Force 
test  plan  could  not  be  analyzed;  however,  investigations  for 
microstructural  variations  and  their  effects  on  dynamic  material 
modeling  behavior  in  the  CAST-HIP  TiAl  material  alloy  B  are 
carried  out  of  their  stability  and  microstructural  mechanisms. 

The  alloy  B  processing  test  conditions  are  in  test  regime  for 
temperatures  from  1000  to  1175  C  and  at  strain  rates  from 
10~^'^  to  10  S”^. 
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2.0  APPROACH  TO  THERMOMECHANICAL  MODELING 

AND  STUDY  OF  MICROSTRUCTURAL  DYNAMICS 

The  thermomechanical  processing  of  materials  can  be  analyzed 
using  a  knowledge  of  interrelation  between  their  processing  para¬ 
meters  and  microstructural  dynamics.  The  theoretical  basis  for 
these  analyses  is  based  on  the  approach  ^  ^  of  dynamic  material 
modeling  DMM  which  is  a  significant  tool  to  arrive  at  the  "safe 
processing  windows"  and  for  designing  material  deformation 
processes.  This  approach  has  been  utilized  in  the  present  re¬ 
search  work  and  as  seen  in  some  of  the  earlier  studies  ^  for 
certain  aluminum  based  2024-Si C  alloys,  nickel  based  alloys, 
Ti6Al4V,  Ti-based  6242  alloys,  and  titanium  aluminide  material 
systems.  The  development  of  microstructures  and  constitutive 
behavior  of  a  work  piece  material  depends  on  the  effects  of  stress, 
temperature,  strain  levels  and  strain  rates  during  processing. 

The  development  of  dynamic  material  modeling  DMM  approach 
is  based  on  understanding  the  several  irreversible  thermodynamic 
processes  which  occur  in  the  work  piece  material  and  are  controlled 
by  the  rate  of  energy  input  and  the  subsequent  dissipation  of 
energy  by  dynamic  metallurgical  processes.  In  dual  or  multiphase 
systems,  several  atomic  processes  are  possible  which  combine  to 
create  new  structures  and  provide  the  degrees  of  freedom  needed 

for  stable  plastic  flows.  Stability  is  determined  by  the  con- 

-.2  • 

dition  that  the  excess  entropy  production  rate,  1/2  5  S  ,  is 

greater  than  zero.  Simple  compression  testing  can  be  used  for 
the  dynamic  material  modeling  if  a  "steady"  state  can  be 
approached  asymptotically  with  time.  This  DMM  approach  is  quite 
useful  and  differs  from  the  mechanistic  approach,  for  in  the  DMM 
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approach  the  mechanisms  active  in  a  particular  region  are  determined 
after  the  processing  maps  have  been  constructed.  This  thermo- 
mechanical  DMM  approach  thus  allows  a  material  systems  approach, 
in  which  to  view  a  material  as  an  energy  dispersal  systexm  and  to 
analyze  its  behavior  macroscopically ,  without  making  a  priori 
determinations  of  the  operating  mechanisms. 

This  research  project  involved  the  study  of  effects  of  high 
temperature  flow  processes  in  gamma  based  titanium  aluminides  as 
affected  by  V  -grains,  the  ^2  phases,  any  transformed  ^  phase 
at  higher  temperatures,  interlamellar  spacings  variations  in 
microstructure,  and  other  microstructural  effects  in  the  specified 
titanium  aluminide  material  alloy  systems.  The  conditions  in  this 
referenced  research  are  analyzed  by  carrying  out  the  following: 

(a)  Study  of  the  thermomechanical  modeling  DMM  maps  at  certain 
strain  levels  of  the  PREP-HIP  Ti-48Al-lV  and  of  the  alloy  B 
CAST-HIP  Ti-Al  material 

(b)  Analyses  of  the  operative  microstructural  mechanisms  in 
relation  to  the  stability  maps  in  the  2 /'^'^i^^°c°^stituents 

(c)  Material  microstructural  characterizations  of  the  specific 
PREP-HIP  Ti-48Al-lV  and  CAST-HIP  Ti-Al  material 

(d)  Analyze  and  catalog  any  observable  differences  in  the  two 
specific  test  materials  and  study  of  selected  test  material 
specimens 

(e)  Analysis  of  limiting  critical  strain  level  concept  and 
evaluation  and  formalism  of  any  microstructural  mechanisms 
related  to  material  flows  in  the  specific  test  material  alloy 
systems . 
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3.0  ANALYSIS  OF  THE  PREP-HIP  Ti-48Al-lV  MATERIAL 

This  research  work  has  been  carried  out  in  collaboration 
with  Processing  Science  WPAFB,  MLLM,  Material  Laboratory  and 
conducted  at  Southern  University,  Baton  Rouge,  LA.  After  formula¬ 
tion  of  the  specific  research  problem,  and  the  microstructural 
analyses  approaches,  the  PREP-HIP  Ti-48Al-lV  material  system 
tested  up  to  1350  C  for  strain  rates  from  10°  to  10~^  S~^  was 
selected  for  analyses  of  microstructural  variations  and  dynamic 
material  modeling  behavior. 

Test  data  analyzed  in  the  1000-1250C  of  this  PREP-HIP  Ti- 
48A1-1V  has  been  studied  for  its  flow  curves  for  different  strain 
rate  conditions.  This  part  of  the  study  has  revealed  flow  curves 
approaching  steady  state  or  some  flow  softening.  The  typical 
slight  barelling  in  some  specimens  was  seen  on  microscopic  exam¬ 
ination  to  indicate  microstructurally  observed  cracking  at  end 
regions.  These  studies  were  further  carried  out  to  study  the 
stress-strain  behavior  of  the  PREP-HIP  Ti-48Al-lV  material  at 
1250  C  plus  1275-1350  C.  This  testing  was  carried  out  by  the 
Wright  State  University,  and  test  data  indicated  continuous 
gradual  flow  softening. 
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3.1  ANALYSIS  OF  PROCESSING  MAPS 

The  stress  strain  data  were  analyzed  using  the  dynamic 
material  modeling  computer  program  MIS  to  produce  the  processing 
maps  which  are  then  used  to  provide  the  framework  for  micro- 
structural  analyses  to  control  the  material  flow  characteristics. 

The  stable  processing  regions  marked  with  X  are  noted,  and  are  to 
be  analyzed  in  detail  at  temperatures  of  1250,  and  1275-1350  C. 

This  PREP-HIP  Ti-48Al-lV  material  has  been  earlier  analyzed 
and  had  indicated  cracking  at  10°  S~^  strain  rate,  and  therefore 
the  higher  temperature  compression  testing  to  1350  C  was  carried 
out  from  10  to  10  ^  S  ^  strain  rate  conditions.  The  as-received 
starting  material  PREP-HIP  Ti-48Al-lV  had  been  characterized  in 
detail  for  identification  of  composition  and  starting  microstructure. 

The  overall  processing  maps  with  plot  of  efficiency  with 

stability  maps  at  0.4  strain  level  indicated  intability  near 

1275  C  which  could  be  due  to  the  peritectoid  transformation 

temperature  of  c:  1285  C  as  seen  in  the  Ti-Al  phase  diagram. 

The  study  of  the  titanium  aluminides  system,  it  should  be  noted, 

is  made  difficult  due  to  the  limitation  of  unconfirmed  information 

11,12 

of  phase  boundaries  and  phase  stabilities,  of  the  titanium- 

aluminum  system.  The  processing  maps  are  analyzed  and  thermo¬ 
mechanical  modeling  carried  out  in  our  specified  '9  based  TiAl 
alloy  systems. 
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3.2  MICROSTRUCTURAL  DYNAMICS  AND  THERMOMECHANICAL  MODELING 

IN  PREP-HIP  Ti-48Al-lV 

The  main  objectives  of  this  reseiirch  are  to  identify  and 
characterize  the  microstructural  aspects  in  selected  titanium 
aluminide  materials  related  to  their  DMM  stability  processing  maps. 
This  research  has  been  focussed  on  the  study  of  the  "as-received" 
and  "processing"  microstructures  to  understand  the  microstructural 
evolutions  during  stable  material  flow.  This  involved  problem 
formulation  for  the  TiAl  system,  relevant  literature  review  and 
formulation  of  a  program  to  determine  the  microstructural  con¬ 
stituents  and  their  behavior  in  titanium  aluminides. 

As  formulated  in  the  proposal,  the  PREP-HIP  Ti-48Al-lV 
material  system  tested  up  to  1350  C  for  strain  rates  from  10°  to 
lO”^  S  ^  had  to  be  analyzed  for  microstructural  variations  and 
dynamic  material  modeling  behavior.  Specimens  TAV64WSU,  TAV65WSU, 
TAV63WSU,  TAV73WSU,  TAV79WSU,  TAV76WSU,  TAV78WSU  and  TAV77WSU  for 
different  temperature  and  strain  rates  supplied  by  UES,  Inc. 
were  initially  evaluated  for  their  microstructural  characterizations. 
In  later  discussions  with  Processing  Science  Group,  MLLM,  and  during 
visit  to  the  WPAFB  Materials  Laboratory,  eight  additional  specimens 
were  supplied.  These  specimens  TAV71WSU,  TAV67WSU,  TAV69WSU, 
TAV66WSU,  TAV68WSU,  TAV72WSU,  TAV74WSU,  and  TAV75WSU,  were  also 
analyzed  for  specific  evaluation  of  their  microstructural  evolu¬ 
tions  related  to  the  DMM  maps. 
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A  program  to  determine  the  microstructural  constituents  and 
their  behavior  in  titanium  aluminides  was  set  up.  The  following 
determinations  were  made; 

Optical  metallography,  microstructural  analysis 
using  SEM  techniques,  microhardness  measurements. 

For  metallographic  specimens  preparation,  Kroll's  Reagent  with 
composition  3ml  HF,  6  ml  HNO^ ,  100  ml  H2O,  was  found  to  be  suitable 
for  optical  microstructural  analysis.  Test  conditions  of  the  material 
specimens  are  given  in  Table  I  of  strain  rates  and  temperatures 
during  their  controlled  compression  testing. 

The  "as-received"  PREP-HIP  material  on  metallographic  exam¬ 
ination  as  the  starting  structure  indicated  a  large  variability 
in  the  grain  size  of  the  V  -grains  which  could  be  due  to  the 
variability  and  distribution  in  this  preconsolidated  PREP-alloy 
powder  size.  The  microstructure  also  revealed  regions  of  possible 
interlamellar  structure,  in  the  dualV  +  o<2  phase  regions,  fine 
second  phase  oC  2  particles,  and  twins  in  the  matrix  V -phase 
as  seen  in  figures  la  and  lb.  In  this  PREP-HIP  alloy  powder 
using  the  transmission  electron  microscopy ,^^a  number  of  widely 
separated  V  grains  were  also  observed  which  contained  thin  0^  2 
lamella.  Some  V  grains  had  also  consisted  of  a  high  density 
of  dislocations,  twins,  and  stacking  facults.  It  should  be  noted 
that  since  most  of  the  contiguous  V  grains  did  not  contain  the 
ot 2  -  phase,  it  is  believed  that  the  high  temperature  flow 
processes  will  be  essentially  governed  by  a  large  volume  of  V 
grains . 
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Table  1:  PREP  HIP  TI48A11V  MATERIAL  TEST  CONDITIONS 


(OC)  1250 

1275 

1300 

1325 

1350 

TAV70WSU 

TAV71WSU 

TAV67WSU 

TAV72WSU 

TAV79WSU 

10”^ 

TAV60WSU 

TAV64WSU 

TAV69WSU 

TAV73WSU 

TAV76WSU 

10"^ 

TAV62WSU 

TAV65WSU 

TAV66WSU 

TAV74WSU 

TAV78WSU 

10~^ 

TAV61WSU 

TAV63WSU 

TAV68WSU 

TAV75WSU 

TAV77WSU 
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The  analysis  of  the  stability  maps  and  plots  of  efficiency 
with  stability  maps,  figure  2,  on  dynamic  material  modeling  at 
0.4  strain  level  are  used  to  investigate  the  stable  and  unstable 
regions  of  the  material  flow  conditions.  It  should  be  noted  in 
these  that  there  appears  to  be  the  presence  of  instability  at  about 
1275  C.  Close  to  this  and  at  these  higher  temperatures  near  1275  C 
as  related  to  the  TiAl  phase  diagram,  figure  3,  in  which,  the 
TiAl  system  shows  a  phase  transformation  at  ^  1285  C.  Close  to 
this  temperature,  and  up  to  1350  C,  the  system  has  been  investigated 
by  microstructural  analyses  to  identify  the  microstructural  mechanisms 
during  thermomechanical  processing  of  the  PREP-HIP  Ti-48Al-lV. 

Several  specimens,  as  shown  in  figure  2  have  been  investigated. 

Overall  microanalysis  of  a  typical  compression  tested  specimen 
confirmed  on  microprobe  analysis  the  basic  TiAl  phase  composition, 
the  overall  scan  and  microanalysis  report  results  are  included  in 
figure  4a  and  b.  In  some  isolated  locations,  regions  with  in¬ 
clusions  are  seen.  Also  in  certain  strain  rate  temperature  test 
conditions,  using  the  DIG  microscopic  methods,  some  bright  white 
phase  particles  are  seen.  These  microscopic  studies  indicate  the 
possibility  of  0^2,  ,  and  of  some  transformed  P  phase  particles. 

Analyses  of  these  might  require  some  transmission  thin  foil  elec¬ 
tron  microscopy  work  and  are  recommended. 
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Figure  la.  As  received  PREP-HIP  Ti-48-Al-lV  material  showing 
regions  of  possible  interlamellar  structure,  fine 
second  phase  particles,  and  twins  in  the  matrix 
V  phase.  X400 


Figure  lb. 


SEM  micrograph  in  the  compositional  mode  of  the 
as  received  PREP-HIP  Ti-48-Al-lV  material  showing 
effects  of  stress  and  possible  interlamellar 
arrangement.  XIOOO 
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Figure  2.  Plot  of  efficiency  with  stability  map  of  PREP-HIP 


Ti-48Al-lV  at  0.4  strain. 


Temperature 


Weight  Percent  Aluminum 


Figure  3.  Phase  diagram  of  the  Ti-Al  system  showing  Cq ,  the 
base  alloy  composition  with  48%  Al. 
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Figure  4a;  Microanalysis  report  of  the  TAV77WSU  specimen. 


75-16 


ir.  u  j 


s 

I 

I 


s 


610  KCV 


Figure  4b.  Overall  scan  showing  peaks  on  microprobe  analysis 
of  Ti  and  A1 ,  with  corresponding  microanalysis 
report  in  Figure  4a.  Specimen  TAV77WSU. 
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Optical  microscopy  and  SEM  studies  performed  on  the  selected 

test  specimens,  at  the  higher  temperature  of  1350C  at  strain  rates 

from  10  to  10^  S  have  in  different  samples  shown  oxide  scale 

on  the  flat  ends  and  also  presence  of  cracks  as  seen  in  figure  5a. 
c  -1  -1 

This  is  for  ^  =  10  s  at  1350  C.  Some  cracks  are  seen  on  both 
sides  and  some  inclusions  are  also  seen.  At  the  lower  strain  rate 
of  t  =  10  S  ,  it  appears  that  the  possible  formation  of  the 
^  -2  transformed  into  coarse  ^  -phase  can  be  seen  appearing  in  a 
lacy  configuration.  These  -phase  particles  seem  to  envelope  the 
"^-grains  at  most  locations.  Some  twinning  is  still  seen  in  the 
interior  of  the  V  grains  as  seen  in  figure  5b  and  5c.  Similar 
effects  are  also  present  in  typical  figure  5d  and  5e  at  lower  strain 
rate  of  H  =  10  s  .  Interlameller  features  showing  effects  of 
deformation  and  high  temperature  are  also  seen.  Partial  recry¬ 
stallization  and  dynamic  strain  effects  can  also  be  seen  in  these 
micrographs,  as  seen  in  figures  5f  for  1325  C,  10”^  s”^  strain 

rate,  and  in  figure  5g  and  5h  for  1325  C,  10°  S~^  strain  rate 
-2  -1 

and  10  S  strain  rate. 

At  the  specimens  close  to  -Cr  1285  C  of  the  peritectoid  tran¬ 
sformation  temperature,  which  for  specimens  at  1275  C,  several 
strain  rates  form  10°  to  10  ^  S  ^  were  investigated,  figures  6a, 
b,  c,  d,  and  e.  Specimens  tested  at  1300  C,  figures  6f  through  j, 
indicate  very  little  film  and  cracks  on  flat  end  areas  and  show 
twinning  and  presence  of  some  P  phase  is  also  seen.  Nucleation 
of  the  P  phase  in  seen  at  lower  strain  rates  1275  C,  10~^  and 
10  ^  S  ^ .  In  some  regions,  presence  of  some  blocky  grains  is 
also  observed.  Also  seen  is  partial  recrystallization  at  about 
1300  C,  and  dynamic  nucleation  and  growth  of  the  P  phase.  Overall 
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microhardness  values  are  about  240-258  VHN,  whereas  near  and  inside 

blocky  grains,  the  hardness  values  appear  higher,  the  measurement 

of  microhardness  of  the  transformed  grains  has  been  difficult 

due  to  their  small  size  particles,  figures  6i  and  6j.  It  is  also 

-2  -1 

seen  in  certain  1300  C,  10  S  specimens  that  the  center  regions 
indicate  fine  grain  structure  with  very  little  twinning,  whereas 
the  end  portions  show  twinning  and  also  dynamic  recrystallization 
effects  and  possible  transformed  fi  grains  formation,  figures  6k 
and  61. 
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Figure  6a.  1275  C,  10°  S"^.  x200 
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4.0  ANALYSIS  OF  THE  ALLOY  B  CAST-HIP  Ti-Al  MATERIAL 


The  CAST-HIP  Ti-Al  material  was  examined  for  microstructural 
evolution  of  specimens  compression  tested  at  temperatures  from 
1000-1175  C  at  strain  rates  from  10  to  10  S~^.  The 

several  specimens  analyzed  for  this  study  were  labelled  as  per 
the  following  conditions.  Table  2.  The  nominal  composition 
of  this  alloy  is  similar  to  Ti  48A1  base  alloy,  but  the  chemistry 
specifications  and  the  casting  and  HIP  procession  specifications 
are  proprietary  and  are  thus  not  included.  The  microstructural 
evaluations  were  carried  out  in  a  similar  manner  as  for  the 
earlier  PREP-HIP  Ti-48Al-lV  alloy. 


Table  2  Conditions  of  Specimens  Compression  Tested  for  Alloy  B 
CAST  HIP  Material 


Specimen  Tested 

Temp.  (  c)  Strain  Rate  (S  )  Identification 


1000 

C 

-1  -1 

10  S 

B-17 

1000 

C 

lO"^  S~^ 

B-8 

1050 

C 

10~^  S~^ 

B-19 

1125 

C 

lo"^  s~^ 

B-3 

1150 

C 

lo"*^  s"^ 

B-31 

1150 

C 

s-1 

B-39 

1175 

C 

10~^'^  s”^ 

B-48 

1175 

C 

10-2-^  s-1 

B-40 
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4.1  ANALYSIS  OF  PROCESSING  MAPS 

The  test  data  obtained  on  isothermal  compression  testing 
of  the  alloy  B  CAST-HIP  TiAl  material  was  used  to  compute  the 
processing  stability  maps  of  this  material.  In  these  processing 
maps,  for  data  from  1000  to  1175  C,  stable  regions  are  seen  about 
1000-1050  C,  between  1150-1175  C,  and  also  unstable  regions  are 
noted  close  to  -C:  1125  C.  This  could  be  due  to  the  possible  phase 
transformation  seen  at  ■:^1125  C  in  the  Ti-Al  phase  diagram. 

In  this  analysis,  and  based  on  samples  analyzed,  upto 
4-  1125  C  temperature  conditions  are  considered  low  temperature 
conditions  and  above  1125  to  1175  C  are  considered  high  temperature 
conditions.  Combinations  of  these  low  temperature,  high  temperature 
conditions  with  several  strain  rate  conditions  are  used  to  analyze 
the  microstructural  evolutions  as  seen  and  investigated  in  these 
alloy  B  cast  material. 
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4.2  MICROSTRUCTURAL  DYNAMICS  AND  THERMOMECHANICAL  MODELING 

STUDIES  IN  CAST-HIP  TiAl  MATERIAL 

The  specimens  as  shown  in  Table  2  earlier  were  analyzed. 

The  "as-received"  CAST-HIP  material  revealed  some  twinning, 
variability  in  grain  size  of  the  matrix  material,  and  possible 
interlamellar  structure,  figure  7a  and  7b.  The  material  exhibits 
variability  in  this  cast  product,  and  the  interlamellar  regions 
or  any  dual  phases  need  to  be  identified  using  this  foil  trans¬ 
mission  electron  microscopy. 

For  specimens  on  compression  testing  at  low  temperature 
^  1125  C  and  high  strain  rates  conditions  (f  ^  10  S  ), 

the  structures  are  seen  to  exhibit  in  general  the  presence  of 
microcracks  at  one  edge,  usually  isolated  to  one  end  corner. 
Detailed  optical  and  scanning  electron  microscopy  analysis 
revealed  banded  structure,  possibility  of  interlamellar  2 
phase  lath  type  structure,  0^  2  phase  particles, presence  of  porosity 
and  effects  of  processing  and  deformation  conditions  as  well  as 
wide  variability  in  microstructures  in  the  interior  compared  to 
the  end  regions.  Typical  micrographs  are  shown  in  figures  8  and 
9.  It  is  seen  that  as  the  material  deforms,  power  dissipation 
is  mainly  through  stored  energy  as  atomic  defects,  and  in  these 
specimens  end  corners  have  shown  the  presence  of  microcracks. 
Microstructures  in  these  low  temperature,  high  strain  rate 
conditions  would  have  significant  dislocations  interactions  and 
hardening  of  the  matrix  phase.  Study  of  behavior  of  the  defect 
structures  and  recovery  processes  in  these  materials  requires 
transmission  electron  microscopy  work  to  confirm  the  mechanisms, 
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which  could  not  be  carried  due  to  non  availability  of  instrumen¬ 
tation  and  the  techniques.  Any  subgrains  formation  or  dynamic 
recrystallization  effects  would  become  possible  depending  on 
temperature  and  diffusional  mechanisms,  possibly  with  increasing 

temperature  and  at  lower  strain  rates. 

-  5  -2  5-1 

At  strain  rates  from  10  '  to  10  ’  S  ,  and  at  the  higher 

temperature  conditions  (T'^1150  C)  some  typical  microstructures 
of  the  CAST-HIP  TiAl  material  are  shown  in  figures  10  and  11. 

For  1150  C  compression  tested  specimens,  a  lot  of  variation  is 
seen  for  typical  microstructures.  Effects  of  banding  and  deforma¬ 
tion  in  the  2  +  ^  phase  structures  are  still  seen  with  a  near 
end  crack.  This  does  not  seem  to  agree  with  the  location  of 
stability  regions  at  and  close  to  1150  C.  At  and  above  ^1125  C 
transformation  temperature,  under  dynamic  conditions  and  aided 
by  lower  strain  rates  there  is  possibility  of  the  ordered  2 
phase  to  transform  to  the  phase.  These  and  other  energy  dissi¬ 
pative  processes  can  involve  diffusional  flows,  transformations 
and  recovery  processes.  It  should  be  noted  that  this  material 
exhibits  much  more  variability  in  structure  compared  to  the  PREP- 
HIP  Ti-48Al-lV  and  in  typical  structures  has  revealed  mixed  and 
somewhat  polyhedral  grains,  with  inter lamellar  structure  towards 
end  portions  of  the  specimens,  whereas  in  the  interior,  structures 
involved  appear  to  be  related  to  dynamic  recovery  and  polygoni- 
zation  processes.  Effects  of  banding  and  deformation  are  also 
seen  in  some  of  the  specimens. 
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Figure  7a.  As  received  CAST-HiP  alloy  B  TiAl  material  showing 

some  twinning,  variability  in  the  V  grain  size  of  the 
matrix.  XlOO 


Figure  7b.  As  received  CAST-HIP  alloy  B  TiAl  material  side 
microstructural  view  with  possible  interlamellar 
structure,  some  twinning,  and  variability  in  grain 
size.  XlOO 
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(a)  XlOO 


(b)  XlOO 

Figure  3.  SEM  Micrographs  of  CAST-HIP  TiAl  material  for 
low  temperatures,  high  strain  i^ate  compression 
tested  conditions  (a)  1000  C,  10“  s"^ 

(b)  1125  C,  10“^  3"^  conditions. 
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(c)  X500 


(d)  XlOO 

Figure  8.  SEM  micrographs  of  CAST-HIP  TiAl  material  for  low 
temperature,  high  strain  rate  compression  test 
conditions  (a)  showing  porosity  and  corner  oriented 
crack  (b)  1050  C,  10”^  >'c)  1000  C,  10"^  3~^. 
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(c)  XlOO 


X200 


Figure  9.  Optical  micrographs  of  CAST-HIP  TiAl  material 

for  low  temperatures,  high  strain  rate  compression 
test  conditions  showing  banding,  inter lamel lar 
structure  and  possible  dynamic  processing  effects 
(c)  1050  C,  10"^  S"^  (d)  1125  C,  10“^  S'^. 


X400 

Figure  9.  Optical  micrograph  of  CAST-HIP  TiAl  material  for 
low  temperature,  high  strain  rate  compression 
tested  specimen  (e)  1125  C,  lO”^  S~^. 


(a)  XIOOO 


(b)  X50 

Figure  10.  SEM  micrographs  of  CAST-HIP  TiAl  material  for 
high  temperature,  low  strain  rate  compression 
tested  specimen  at  1150  C,  10“^-^ 
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(c)  X200 


Figure  10.  SEM  micrograph  of  CAST-HIP  TiAl  material  for 
high  temperature,  low  strain  compression 
tested  specimen  at  1175  C,  10"^*^  S~^. 
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Figure  11. 
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(b)  X200 

Optical  micrographs  of  CAST-HIP  TiAl  material 
for  high  temperature,  high  strain^rate  compression 


tested  specimens  (a)  1150  C,  10 


1175  C, 


(d)  X200 

Figure  11.  Optical  micrographs  of  CAST-HIP  TiAl  material 

for  high  temperature,  low  strain  rate  compression 
tested  specimen.  Variability  in  the  specimen 
structure  seen  for  interior  region  in  (c2,and 
towards  edge  in  (d)  for  1175  C,  lO"^'^  S 


75 


X200 


Figure  11.  Optical  micrographs  of  CAST-HIP  TiAl  material 
for  high  temperature,  low  strain  rate  compres 
tested  specimen  (e)  1150  C,  10”^'^  S~^. 
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5.0  DISCUSSION 


The  mechanistic  modeling  approach  to  understanding  work¬ 
ability  is  based  on  fracture  and  material  behavior  based  on 

atomistic  models.  Such  studies  of  atomistic  fracture  mechanisms 

14  15 

as  done  by  Ashby  and  Frost  and  Raj  provide  some  basic  under¬ 
standing  of  microstructural  operating  mechanisms,  but  are  not 
completely  suitable  in  case  of  complex  engineering  alloys 
such  as  titanium  aluminides.  It  should  be  further  understood 
that  the  workability  of  a  material  would  be  a  function  both 
of  the  material  as  well  as  its  thermomechanical  processing 
methods . 

The  material  characteristics  would  be  dependent  upon  phase 
and  compositional  effects,  grain  size,  grain  distribution  the 
as-received  structure  and  material  dynamic  effects.  The  thermo¬ 
mechanical  process  characteristics  would  primarily  include  the 
kinematic  variables  such  as  the  strain  rate  and  temperature 
conditions,  critical  strain  levels  and  other  process  considera¬ 
tions  . 

A  direct  approach  of  understanding  the  microstructural 
mechanisms  is  based  upon  studying  the  dynamic  material  modeling 
DMM  maps  and  evaluation  of  stability  regions  and  controlling 
microstructures.  This  approach  was  utilized  in  this  research 
by  direct  experimental  structural  characterizations,  study  of 
DMM  maps  and  microstructural  analyses  of  the  two  types  of 
titanium  aluminide  alloy  system''  as  specified. 
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The  microstructural  dynamics  studies  have  clearly  shown 
the  differences  in  the  microstructural  evolutions,  and  effects 
of  different  processing  histories  in  case  of  the  two  specified 
titanium  aluminide  alloy  systems.  In  case  of  PREP-HIP  Ti-48Al-lV, 
power  partitioning  has  been  shown  to  be  due  to  the  dynamic  recovery 
and  polygonization  processes  related  to  subgrain  formation.  This 
system  has  not  shown  any  grain  boundary  sliding,  wedge  cracking 
or  superplasticity  behavior  mechanisms  in  these  intermeltallic 
alloys.  The  system  has  indicated  overall  metallurgical  stability, 
though  mechanical  instabilities  are  seen  as  porosity,  and  crack 
propagation  in  some  specimens. 

The  dynamic  material  modeling  approach  has  pointed  out  the 
conditions  of  stability  in  processing  flow  and  has  clearly 
shown  a  region  of  mechanical  stability  C:  1150  C.  Furthermore, 
for  the  PREP-HIP  Ti-48Al-lV  material,  transitions  are  seen  in 
the  stability,  instability  regions  close  to  1^1125  C  and 
^  1285  C,  the  regions  close  to  transformation  temperatures 
seen  in  the  Ti-Al  phase  diagram.  The  effects  of  the  higher 
temperatures  phase  in  the  study  of  the  PREP-HIP  Ti-48Al-lV  are 
seen  in  the  presence  of  any  ^  2>  ^  transformed  |3  • 

Close  t'  1350  C,  the  system  has  shown  the  dynamic  nucleation 
and  growth  of  the  P  phase  This  dynamic  nucleation,  dependent 
on  the  temperature  and  strain  rate  conditions,  could  have  set 
in  at  -22:1275  C,  with  possibility  of  coarse  3  formation  for 
1350  C  specimen  conditions.  Also,  this  alloy  system  has  shown 
a  transition  in  material  flow  conditions,  such  as  in  going 
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from  stability  regions  in  processing-stability  maps  at  the  0.3 
strain  level  to  the  0.5  strain  level  which  indicates  that  the 
deformation  processing  should  be  stopped  at  a  certain  "critical" 
strain  level.  More  microstructural  characterization  and  analyses 
are  needed  in  this  direction  to  fully  evaluate  this  concept. 

The  effects  of  material  flow  conditions  are  largely  depen¬ 
dent  on  the  characteristics  of  the  predominant  matrix  V  phase, 
with  only  small  amounts  of  the  ^ny  transformed  ^ 

phase.  The  microstructures  have  revealed  twinning,  microcracks, 
and  dynamic  recovery,  polygonization,  and  partial  dynamic  re¬ 
crystallization  processes.  Interlamellar  features  showing 
effects  of  deformation  and  high  temperature  are  also  seen.  The 
microstructures  also  reveal  oxide  scale  on  flat  ends  and  also 
presence  of  microcracks  in  some  of  the  compression  tested 
specimens . 

In  case  of  the  PREP-HIP  Ti-48Al-lV  material,  the  high 
temperature  deforms^tion  mechanisms  are  seen  as  leading  to  the 
possible  formation  of  the  2  transformed  into  coarse  /3  phase 
in  the  lacy  configuration  at  1350C.  It  should  be  understood 
that  dislocation  interactions,  creation  and  movement  of  dis¬ 
location  boundaries,  and  creation  and  movement  of  transformed 
phase  boundaries  would  be  the  operative  steps  in  these  mechanisms. 
These  microstructural  transformations  appear  to  be  surface 
controlled,  rather  than  bulk  volume  diffusion  controlled,  and 
the  amount  or  volume  fraction  of  the  transformed  phase  could  be 
correlated  to  the  nucleation  and  growth  rates  under  the  dynamic 
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process  conditions.  Such  models  can  be  then  related  to  kinematic 
variables  such  as  temperature  and  strain  rates. 

The  enchanced  effects  of  the  second  phase  could  be  invoked 
by  changing  basic  chemistry  of  the  titanium  aluminide  alloy 
system.  A  possible  study  with  hard  particle  dispersoids  of 
TiB2  was  considered,  but  as  such  materials  were  not  available, 
another  formulation  of  the  base  alloy  system  was  evaluated  in 
form  of  the  CAST-HIP  TiAl  material.  These  CAST-HIP  TiAl 
compression  tested  specimens  have  shown  presence  of  twinning; 
and  in  several  temperature,  strain  rate  conditions  presence  of 
shear  bands.  These  shear  bands  are  seen  to  be  more  pronounced 
in  the  central  regions  of  high  strains  in  these  compression 
tested  CAST-HIP  materials.  These  shear  bands  could  have 
further  acted  as  sites  for  potential  corner  cracks  which  are 
seen  in  these  materials. 

The  CAST-HIP  TiAl  material,  in  the  specimens  analyzed  to 
1175C  show  more  pronounced  non-uniform  microstructural  features 
compared  to  the  PREP-HIP  Ti-48Al-lV  material.  This  suggests 
the  possibility  of  hysteresis  or  memory  effect  related  to  the 
starting  as-received  structures  in  those  CAST-HIP  material. 

In  addition  to  effects  of  shear  bands  which  are  exhibited 
by  these  CAST-HIP  TiAl  materials  these  microstructures  also 
reveal  banding,  interlamellar  structure  and  possible  dynamic 
processing  effects.  As  the  compression  tested  specimens  are 
analyzed  to  1175  C,  no  2^  P  transformation  is  expected.  The 
presence  is  seen  of  the  possible  interlamellar  <<  2  +  phase 
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lath  type  structure.  The  overall  microstructures  in  these 
materials  show  wide  variations  in  the  interior  compared  to 
the  end  regions  which  could  be  due  to  the  inhomogenity  of 
deformation  and  the  processing  conditions  of  the  CAST-HIP 
process . 
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6.0  CONCLUSIONS 


1.  The  present  research  has  evaluated  the  PREP-HIP  Ti-48Al-lV, 

isothermal  compression  tested  from  1000  -  1350  C  for 

several  strain  rate  conditions,  and  the  CAST-HIP  TiAl 

alloy  system  isothermal  compression  tested  from  1000  - 
-  5  -25-1 

1175  C  from  10  ‘  to  10  *  S  strain  rate  conditions, 

for  their  DMM  maps  and  the  microstructural  behavior  of 
these  alloy  systems. 

2.  The  microstructural  dynamics  of  the  PREP-HIP  Ti-48Al-lV 
and  the  CAST-HIP  TiAl  material  in  this  study  indicate  that 
the  presence  of  prior  deformation  structures  and  applied 
thermomechanical  processing  parameters  have  resulted  in 
distinct,  nonuniform  microstructures  and  shear  bands  in 
case  of  the  tested  CAST-HIP  TiAl  material.  However,  for 
the  CAST-HIP  TiAl  material  these  determinations  are 
based  on  limited  available  materials  tested  to  1175  C. 

3.  The  microstructural  analyses  in  case  of  the  PREP-HIP 
Ti-48Al-lV  material  have  exhibited  the  nucleation  and  growth 
of  the  transformed  ^  phase  resulting  with  the  peritectoid 
transformation  at  ^  1285  C.  This  fi  phase  for  compression 
tested  conditions  to  1350  C  is  seen  in  a  lacy  configuration 
appearing  to  envelope  the  grains  in  the  matrix  of  the 
PREP-HIP  Ti-48Al-lV.  This  is  considered  to  be  surface 
controlled  and  the  extent  of  this  growth  transformation 
should  be  dependent  both  on  the  dynamic  nucleation  and 
growth  rates  of  these  new  grains  under  the  dynamic 

process  condition. 
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4.  The  formation  of  metallurgical  stable  structures,  dislocation 

related  subgrain  boundaries,  their  nucleation  and  growth 
rates  and  formation  and  growth  rates  of  ^  /  and  any 

transformed  fh  phases  is  expected  to  be  affected  by  changes 
in  strain  rates  and  temperatures.  For  constant  temperature 
conditions,  the  dislocation  densities,  any  stacking  faults, 
related  interactions  and  phase  transformations  related  to 

the  Ti-Al  system  could  increase  with  increasing  strain 
rates.  These  can  be  evaluated  further  with  transmission 
electron  microscopy  and  should  be  correlated  with 
differential  scanning  calorimetry  to  confirm  these 
dynamic  microstructural  aspects  and  the  nucleation  and 
growth  of  the  transformed  ^  phase  at  ^  1285  C. 

5.  The  metallurgical  synthesis  attempted  in  the  thermo¬ 
dynamically  stable  TiAl  intermetallic  alloy  systems  using 
quantitative  microscopy  methods,  SEM  techniques,  optical 
metallography  with  differential  interference  contrast  DIC 
procedures  has  shown  the  operative  mechanisms  of  dynamic 
recovery,  subgrain  formation,  polygonization  and  dynamic 
recrystallization  in  selected  temperature,  strain  rate 
conditions  in  the  PREP-HIP  Ti-48Al-lV  and  CAST-HIP  TiAl 
material . 

6.  Both  specific  systems  that  are  studied  in  this  research 
have  shown  metallurgical  stability  in  different  temperature, 
strain  rate  conditions.  To  1175  C  compression  test 
conditions,  it  is  seen  that  the  presence  of  oC  2  phase 

is  very  limited,  and  the  hot  deformation  processes  can  be 
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controlled  mainly  by  the  -phase  matrix.  For  the  Ti-48Al-lV 
compression  tested  to  1350  C,  this  system  has  indicated  the 
dynamic  nucleation  and  growth  of  the  phase  related  to 
the  peritectoid  transformation  in  the  Ti-Al  system.  For 
specimens  compression  tested  at  1250  C  and  higher,  the 
presence  of  d.  transformed  ^  is  seen  as  related  to  this 
dynamic  nucleation  and  growth  of  the  P  phase  and  the 
reversion  and  the  peritectoid  transformation  at 1285  C. 

The  structural  evolutions  and  microstructural  aspects  for  stab 
material  flow  conditions  have  been  analyzed  in  the  selected 
titanium  aluminide  alloy  systems. 
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Abstract 


The  mutual  chemical  compatibility  of  some  of  the  potential  matrix  and 
reinforcement  materials  for  ceramic- ceramic  composites  was  analytically 
evaluated  using  the  available  thermodynamic  data.  The  material 
combinations  evaluated  include  A^N-TiBj,  AfN-ZrBj,  AfN-HfBj,  TiC-TiBj, 
TiC-ZrBj,  TiC-HfBj,  TiN-AfjOj,  TiN-ZrQj  and  TiN-HfOj.  In  addition,  the 
chemical  stability  of  TiC,  AfN  and  TiBj  in  a  matrix  of  ^£2^3,  ZrOj  or  HfOj 
was  assessed  experimentally.  Further,  the  oxidation  behavior  of  the 
TiC-A^^^Jjs  TiC-Zrflj,  A^N-A^jOj,  A^-ZrOj,  TiC-TiBj  and  ZrC-ZrBj  composites 
was  also  studied. 


I.  Introduction 


The  mutual  chemical  compatibility  of  the  constituents  of  a 
ceramic  composite  material  is  one  of  the  important  considerations  in 
developing  such  materials  for  high  temperature,  aerospace 
applications.  Considerable  effort  is  therfore  currently  being  devoted 
to  the  study  of  the  mutual  chemical  compatibility  of  potential  matrix 
and  reinforcement  materials. 

The  present  report  describes  the  results  of  analytical 
evaluations  of  chemical  compatibility  of  constituents  of  a  few 
nitride- boride,  carbide- boride  and  nitride- oxide  material  combinations 
and  the  results  of  experimental  studies  on  a  few  nitride- oxide, 
carbide- oxide,  boride- oxide  and  carbide- boride  material  combinations 
which  were  chosen  on  the  basis  of  earlier  screening  studies  and  were 
described  in  the  research  proposal  for  this  project. 

II.  Objectives  of  the  Research  Effort 

The  overall  objective  of  the  effort  was  to  identify  the  pairs  of 
materials  which  would  be  mutually  compatible  in  the  temperature  range 
of  1200-2000°C.  Specifically,  the  objectives  were: 
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(i)  To  evaluate,  analytically,  the  mutual  chemical  compatibility  of 
some  of  the  potential  constituent  materials  for  ceramic 
composites  for  high  temperature  applications.  The  materials 
chosen  for  evaluation  included  ZrBj,  HfBj,  TiBj,  TiC,  A£N,  TiN, 
1^2 O3  ZrOj,  and  HfO,. 

(ii)  To  experimentally  study  the  mutual  chemical  compatibility  of 

constituents  in  the  composites  of  chosen  material  pairs,  namely, 
TiC-A^jOj,  TiC-ZrO,,  TiC-HfOj,  kB-ZiQ^,  A^N-HfOj, 

TiC-TiBj,  ZrC-ZrBj,  TiBj-A^^^^a*  TiBj-ZrOj  and  TiBj-HfOj. 

(iii)  To  study  the  oxidation  behavior  of  A0,  TiC  and  ZrBj  in  the 
TiC-A^2^3j  TiC-ZrOj,  A0-A^2l^3>  A^-ZrOj,  TiC-TiBj,  and 
ZrC-ZrBj  composites 

In  the  following,  the  progress  made  in  each  of  the  above 
areas  will  be  described. 


III.  Analytical  Evaluation  of  Chemical  Compatibility  of  Potential  Matrix 
and  Reinforcement  Phases 

The  analytical  evaluations  have  been  made  on  the  basis  of  the 
results  of  thermodynamic  calculations  making  use  of  the  available 

1—3 

thermodynamic  data.  The  material  pairs  thus  evaluated  include: 
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(1) 


A^-TiBj,  A^N-ZrBj,  A^N-HfBj  (examples  of  nitride- boride 
combinations) 

(ii)  TiC-TiBj,  TiC-ZrB^,  TiC-HfBj,  (examples  of  carbide- boride 
combinations),  and 

(iii)  TiN-A^jOj,  TiN-ZrOj  TiN-HfOj,  (examples  of  nitride- oxide 
combinations) . 

The  results  obtained  for  these  systems  are  described  in  the 

following; 

(a)  A^N  in  a  matrix  of  TiBj,  ZrB^  or  HfBj  (or  TiBj,  ZrBj  or  HfBj 

in  a  matrix  of  AiN) 

Some  of  the  possible  reactions  between  the  matrix  and 
reinforcement  materials  are: 


m  + 

eTiBj 

=  A^IJ  +  TiN  + 

5Ti 

(1) 

A^N  + 

TiBj  = 

kB^  +  TiN 

(2) 

2A^N  - 

K  TiBj 

=  TiA^  +  2BN  + 

kl 

(3) 

A^N  + 

TiBj  = 

:  TiB  +  BN  +  A£ 

(4) 

m  + 

ZrBj  = 

:  ZrN  +  kB^ 

(5) 

A^N  + 

dZrBj 

-  kB  j  2  ZrN  + 

5Zr 

(6) 

kB  + 

m,  = 

:  A^B2+  HfN 

(7) 

kB  + 

eHfBj 

=  A£B^2  BfN  + 

5Hf 

(8) 

The  feasibility  of  the  above  reactions  has  been  evaluated 
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using  the  Gibbs  energy  of  formation  data  for  the  various 
compounds  involved  in  these  reactions.  The  results  of  the 
thermodynamioc  calculations  are  given  in  Table  1.  In  this  table, 
and  in  all  other  tables,  the  subscript  of  AG®  (standard  free 
energy  change)  or  K  (the  equilibrium  constant)  refers  to  the 
reaction  number  for  which  the  value  of  AG®  or  K  is  given.  All 
the  values  of  AG®  are  in  calories. 

Table  1.  Thermodynamic  Data  for  Reactions  (1)  -  (8) 

AG^  or  Temperature  (K) 


1873 

2073 

2273 

agJ 

291,513 

283,732 

275,244 

.35 

9.60  X  10 

-  30 

1.21  X  10 

3.41  X  10' 

0 

AG3 

58,935 

— 

— 

K3 

1.33  X  10' 

— 

— 

ag° 

28,222 

26,188 

24,154 

5.09  X  10*“ 

1.73  X  10*  ^ 

4.76  X  10 

agJ 

338,984 

331,498 

322,304 

79-5 


2.77  X  10 


1.12  X  10 


1.02  X  10 


0 

AGg 


.40 

378,825 
6.21  X  lO"'*'* 


-  35 

374,707 

.  40 

3.11  X  10 


-  3  1 

370,589 

.36 

2.32  X  10 


In  the  above  table,  the  data  for  reactions  (2),  (5)  and  (7) 
have  not  been  included  because  the  phase  is  not  stable  in 
the  temperature  range  1873-2273  K.  Above  1248  K,  it  decomposes 
to  give  A^Bj2  and  a  liquid  Ai-B  phase.  Below  1248  K,  the 
reactions  (2),  (5)  and  (7)  are  not  feasible.  It  must  also  be 
pointed  out  that  the  melting  points  of  the  1ii£  and  k£  phases  in 
reaction  (3)  are  1768  and  933  K,  respectively,  and  the  value  of 
AGj  at  1873  K  has  been  calculated  by  extrapolating  the  data  for 
the  TiA^  phase. 

It  may  be  pointed  out  that  the  error  bars  in  some  of  the 
thermodynamic  data  are  rather  large,  especially  in  the  case  of 
the  data  for  TiB  (±  10  kcal).  However,  the  data  given  in  Table  1 
indicate  that  the  value  of  K  is  the  largest  for  reaction  (4) . 
This  reaction  may  therefore  be  feasible  at  high  temperature. 
These  data  also  indicate  that  A^N  should  be  compatible  with  ZrBj 
and  HfB2. 


(b)  TiC  in  a  matrix  of  TiB2,  ZrB2  or  HfBj 

Some  of  the  possible  chemical  reactions  between  the  matrix 
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and  reinforcement  materials  are: 


TiC  +  3TiBj  =  B4C  +  2TiB  +  2Ti  (9) 
TiC  +  2TiB2  =  B^C  +  3Ti  (10) 
TiC  +  ZrBj  =  ZrC  +  TiBj  (11) 
TiC  +  HfBj  =  HfC  +  TiBj  (12) 


The  thermochemical  calculations  for  the  above  reactions 
yield  the  data  given  in  Table  2. 


Table  2  Thermodynamic  Data  for  Reactions  (9)  -  (12) 


OT 

Temperature  (K) 

1873 

2073 

2273 

AGg 

134,823 

131,582 

128,108 

Kg 

-  l  8 

1.85  X  10 

1.34  X  10'*^ 

4.80  X  10 

148,757 

145,747 

142,313 

^10 

4.38  X  10’ 

-  l  8 

4.30  X  10 

2.07  X  10 

0 

AGu 

4,750 

4,450 

4,150 

Ku 

0.28 

0.34 

0.40 

AGjg 

3,108 

3,316 

3,524 

K,, 

0.44 

0.45 

0.46 

The  above  data  indicate  that  TiC  and  TiBj  should  be 
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mutually  compatible.  However,  in  view  of  the  small  positive 
0  0 

values  of  and  AGjjj  there  is  some  possibility  of  reaction 

between  the  matrix  and  reinforcement  phases,  i.e.,  between  TiC 
and  ZrBj  (or  HfBj). 


(c)  TiN  in  a  matrix  of  ZrQj  or  HfOj 

Some  of  the  possible  chemical  reactions  between  TiN  and 


the  three  oxide  materials  are: 

2TiN  +  =  TijOj  +  2km  (13) 

1 

3TiN  +  =  3Ti0  +  2km  +  (14) 

TiN  +  A^jOj  =  A^jO  +  TiOj  +  jNj  (15) 

TiN  +  ZrOj  =  ZrN  +  TiOj  (16) 

TiN  +  HfOj  =  HfN  +  TiOj  (17) 


Using  the  available  thermodynamic  data  for  the  phases 
involved  in  the  above  reactions,  one  obtains  the  data  given  in 
Table  3. 

Table  3.  Thermodynamic  Data  for  Reactions  (13)  -  (17) 


AG^  or 

Temperature  (K) 

1873 

2073 

2273 

0 

AG, 3 

37,835 

36,677 

* 

^13 

3.84  X 

-  s 

10 

1.36  X  10 

— 

ag;, 

50,396 

* 

* 

Ki4 

1.32  X 

10' ^ 

— 

— 

87,280 

73,328 

* 

Ki5 

6.53  X 

10‘ 

1.86  X  10 

— 
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26,302 

25,952 

* 

8.52  X  10" * 

1.84  X  10'^ 

-■ 

ag;, 

20,812 

20,894 

* 

Ki7 

3.73  X  10'^ 

6.27  X  10'* 

* 

*  TiOj,  TijOj  and  ^TiO  have  melting  points  of  2143,  2115 
and  2023  K,  respectively, 

The  above  data  show  that  TiN  should  be  chemically  compatible 
with  Zrflj  and  HfOj.  The  chemical  and  microstructural 
studies  of  the  A^N-TiOj  system  by  Mocellin  and  Bayer^  showed  that 
at  temperatures  of  1400-1600^0  a  displacement  type  reaction  took 
place  and  that  the  phases  formed  were  TiN  and  This 
confirms  the  results  of  the  thermodynamic  calculations,  shown  in 
the  above  table. 


lY.  Experimental  Studies 

As  stated  earlier,  the  objectives  of  the  experimental  studies 

were  : 

(a)  to  assess  the  mutual  chemical  compatibility  of  the  constituents 
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of  chosen  material  combinations.  These  material  combinations  were 
TiC-A^jOj,  TiC-ZrOj,  TiC-HfOj,  AiN-AijOj,  A/N-ZrOj,  A^N-HfOj, 
TiBj-A^j^sJ  TiBj-ZrOj,  TiBj-HfOj,  TiC-TiBj,  and  ZrC-ZrBj.  The  choice 
of  most  of  these  material  pairs  was  based  upon  thermochemical 
evaluations  of  the  mutual  chemical  compatibility  of  the  materials  in 

,  .  5 

these  pairs  . 

(b)  to  investigate  the  oxidation  behavior  of  TiC  and  A£N,  pure  as 
well  as  embedded  in  the  matrices  of  oxides  (A^2^3>  ZrOj),  and  of 
TiC-TiBj  and  ZrC-ZrBj  composites. 

lY.a  Assessment  of  mutual  chemical  compatibility  of  constituents 
of  the  chosen  material  combinations. 

For  the  purpose  of  obtaining  information  about  their  mutual 
chemical  compatibility,  the  powders  of  the  constituent  materials 
of  the  chosen  combinations  were  intimately  mixed  in  various 
desired  proportions.  The  powders  used  in  the  present 
investigations  were  purchased  from  commercial  sources  (AESAR- 
Johnson- Matthey) .  The  purity  of  the  powders  is  given  below; 
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Material 

A/jOj 

IfOj 

TiBj 

TiC 
ZrB, 

ZrC 
ZrO, 

The  powder  mixtures  were  hot  pressed  into  dense  compacts.  The 
dies  used  were  made  of  graphite,  and  were  lined  with  graphoil. 
Hot  pressing,  in  all  cases  (except  for  pure  AfN) ,  was  carried  out 
in  an  evacuated  chamber  (p~100-250/imflg) .  The  compositions  of  the 
powders/powder  mixtures  that  were  hot  pressed  are  given  in  Table 
4.  This  table  also  gives  the  hot  pressing  temperatures  for 
various  compositions. 

Table  4.  Compositions  of  Powder  Mixtures  Hot  Pressed  and 
The  Hot  Pressing  Temperatures 

Composition  fwtr.)  Maximum  Temperature  f°C)  during 

100  TiC 
lOTiC-AfjOj 

ZOTic-AfjOa 

SOTiC-AfjOj 

35.4TiC-Af203 


Puritv/Grade 

98+7. 

99.997. 

Reactor  grade,  Zr  1.57. 

99.87. 

997. 

977. 

987.,  Hf  1.87. 

99.997. 
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QOTiC-A^jOj 

1980 

20TiC-ZrOj 

2030 

20TiC-IfOj 

2220 

100  km 

1900 

lOA^n-A^jOj 

1740 

20A^N-A^j03 

1600 

44A^N-A^2  03 

1700 

80A^n- A^203 

1820 

90A^N-A^2fl3 

2040 

30Aa-Zr02 

1960, 

2200 

42Aa-Zr02 

1475 

80AiN-Zr02 

2190 

9OAa-Zr02 

2250 

80A^N-Hf02 

2230 

100TiB2 

2370 

20TiB2-A£303 

1800 

20TiB2-Zr02 

2220 

20TiB2-Hf02 

2190 

lOTiC-TiB, 

2310 

20TiC-TiB2 

1875, 

2250 

20TiB2-TiC 

2270 

100  ZrC 

2200 

20ZrC- ZrBj 

1950 

An  interesting  observation  was  made  in  the  course  of  hot 
pressing  of  SOA/N-ZrO^  and  SOA^N-ZrOj  powder  mixtures  at  the 
temperatures  given  in  Table  1.  The  slump  (an  indication  of 
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decrease  in  volume)  was  unusually  large.  When  the  compacts  were 
taken  out  of  the  dies,  they  appeared  to  be  very  thin.  They  had 
golden,  metallic  appearance.  A  large  weight  loss  was  also 
observed.  This  observation  was  not  made  in  the  case  of  the 
42A^N-Zr05  specimen  which  was  hot  pressed  at  a  much  lower 
temperature,  1475° C,  nor  was  it  made  in  the  case  of  the 
QOAO-ZrOj  specimen. 

The  densities  of  hot  pressed  compacts,  as  estimated  from 
their  weights  and  dimensions,  varied  from  ~88  to  987i  of  the 
theoretical  density.  The  theoretical  densities  of  the  composites 
were  calculated  assuming  that  there  was  no  reaction  between  the 
constituent  materials  of  a  composite  during  hot  pressing.  This 
assumption  may  or  may  not  be  valid.  However,  the  validity  of 
this  assumption  could  be  tested  by  characterizing  the  hot  pressed 
specimens  by  x-ray  diffraction  (XRD).  Since  the  composites  were 
made  from  intimate  mixtures  of  powders  by  hot  pressing,  the 
characterization  of  the  composite  specimens  by  XRD  also  yields 
information  regarding  the  new  phases  formed,  if  any,  as  a  result 
of  a  chemical  reaction  between  the  constituents  and  therefore 
about  the  mutual  chemical  compatibility  of  the  constituent 
materials. 
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The  results  obtained  from  UD  of  hot  pressed  specimens  are 
given  in  Table  5. 

Table  5.  Phases  Identified  in  Hot  Pressed  Compacts 


Comoosition  (’wtZl 

Phai?e<?  Identified 

100  TiC 

TiC  (Cubic) 

lOTiC-A^jOj 

TiC,  A^2*^3 

20TiC-A^203 

TiC,  A^203 

30  TiC-A/2^*3 

TiC, 

35.4TiC-A^2^*3 

TiC,  A^j^a 

gOTiC-A^j^a 

TiC,  A^2*^3 

SOTiC-ZrOj 

TiC,  Zr02  (monoclinic) 

20TiC- HfOj 

TiC,  HfOj  (monoclinic) 

AfN 

10AfN-Af203 

A^N  (hexagonal) 

AfN  (hexagonal  + 

cubic),  1^203 

20AfN-A£203 

A^N  (cubic  + 
hexagonal),  A^2^3 

44AfN-A^203 

AfN  (hexagonal  + 
cubic),  A£203 

42AfN-Zr02 

A£N  (hexagonal), 

Zr02  (cubic),  Zr02 

(monoclinic),  ZrN 
(cubic) 

90  AfN-Zr02 

AfN  (hexagonal),  ZrQj 

(cubic),  ZrN  (cubic), 
k  also  aluminum 
oxynitride 

(^^138^287^4 

A^27®39^) 
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SOA^N-Hfflj 

A£N,Hf02 

lOOTiBj 

TiBj 

20TiBj-A^203 

TiBj,  A^j^s 

20TiB2-Zr02 

TiB2,  ZrOj  (mono.) 
TijB,  (?) 

20TiB2-Hf02 

TiBj,  Hf02 

lOTiC-TiBj 

TiB2,  TiC  (very  few 
peaks  of  TiC) 

20TiC-TiB2 

TiBj,  TiC 

20TiB2-TiC 

TiC,  TiBj 

lOOZrC 

ZrC 

20ZrC- ZrBj 

ZrC,  ZrBj 

The  XRD  results  indicate  that  the  two  constituent  compounds  in 
each  of  the  systems  TiC-AfjQj,  TiC-ZrOj,  TiC-HfOj,  A^N-AfjOj, 
A^N-HfOj,  TiBj-A^jOj  TiBj-KfOj,  TiC-TiBj  and  ZrC-ZrBj  are  mutually 
compatible  upto  approximately  the  temperature  at  which  the  respective 
material  combination  was  hot  pressed.  The  constituent  compounds  in 
the  material  combination  A£N-Zr02(and  possibly  also  in  TiBj-ZrOj)  do 
not  appear  to  be  mutually  compatible  as  indicated  by  the  XRD  results 
given  in  Table  5.  In  the  case  of  the  hot- pressed  A^N-ZrOj  specimens, 
the  presence  of  the  cubic  and  monoclinic  zirconia,  aluminum  nitride, 
aluminum  oxynitride  and  zirconium  nitride  phases  was  detected.  The 
"d- values"  reported  in  the  JCPDS  cards  for  the  ZrO  and  ZrN  phases  are 
very  close  to  each  other.  Therefore,  there  can  be  some  question  about 
the  presence  of  ZrN  in  the  hot- pressed  A^-ZrOj  samples.  However,  as 

3 

the  ZrO  phase  is  reported  to  be  gaseous  at  room  temperature  ,  it  was 
concluded 
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that  the  observed  peaks  in  the  XRD  traces  must  be  of  ZrN.  The 
stabilization  of  cubic  zirconia  phase  in  the  A^N-ZrOj  composites 
during  hot  pressing,  as  observed  in  the  present  studies^ is  somewhat 
similar  to  an  earlier  observation  that  cubic  ZrN,  upon  partial 
oxidation,  forms  "amorphous”  cubic  ZrO^  phase^^. 

It  may  be  pointed  out  that  XRD  cannot  detect  the  presence  of  a 
phase  if  its  volume  fraction  is  very  small.  Therefore,  if  any  new 
phases  are  formed  as  a  result  of  a  chemical  reaction  between  the 
constituent  compounds  of  a  material  pair,  and  if  the  rate  of  this 
reaction  is  extremely  slow,  the  reaction  products  may  not  be  detected 
by  XRD  of  the  hot  pressed  specimens.  In  such  cases,  thermal  ageing 
experiment  of  longer  durations  would  be  necessary.  However,  in  view 
of  the  high  temperatures  at  which  the  powder  mixtures  were  hot 
pressed,  the  likelihood  of  a  reaction  being  slow  is  small. 


IV.  b  Oxidation  Studies 


The  study  of  the  oxidation  behavior  of  composites  is  important 
because  the  rate  of  oxidation  of  a  non- oxide  ceramic  may  be  influenced 
by  the  presence  of  other  materials  (oxide  or  non- oxide)  in  the 
composite.  The  oxidation  studies  were  therefore  carried  out  on  the 
hot- pressed  specimens  of  TiC,  A/'N,  TiC-A^2*^3  5  TiC-ZrOj,  TiC-TiBj  and 
ZrC-ZrBj.  A  thermogravimetry  setup  was  used  for  these  studies.  In 
most  cases >  the 
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samples  were  oxidized  in  an  atmosphere  of  flowing  (50-100  m^/min)  air, 
and  in  some  cases,  in  an  atmosphere  of  flowing  pure  oxygen  (technical 
grade).  The  gases  were  passed  over  Drierite  (CaS04)  and  phosphorous 
pentoxide  before  they  entered  the  reaction  tube.  Most  of  the 
experiments  were  carried  out  in  the  temperature  range  ~800-10000C.  In 
the  case  of  the  TiC-TiBj  and  A^N-AijOj  composites,  studies  were  also 
carried  out  at  ~14300C  using  a  thermogravimetric  analysis/ 
differential  thermal  analysis  (TGA/DTA)  apparatus.  The  products  of 
oxidation  were  characterized  using  X-ray  diffraction. 

IV.  b.l  Oxidation  of  the  TiC-A^^Oj  and  TiC-ZrO^  Composites 

The  results  obtained  from  the  oxidation  of  pure  TiC  as  well  as 
the  TiC-A^2^3  composites  of  various  compositions  are  shown  in  Fig. 
1-3.  These  results  show  that  the  oxidation  of  TiC,  pure  as  well  as 
embedded  in  a  matrix  of  alumina,  is  a  strong  function  of  temperature. 
The  plots  in  Fig.  1-3,  show  that  the  rate  of  weight  gain  of  a  sample 
increases  with  increasing  percent  TiC  in  the  composite.  This, 
however,  does  not  imply  that  the  rate  of  oxidation  of  TiC  is  greater 
in  the  composites  containing  larger  weight  fraction  of  titanium 
carbide.  This  is  merely  due  to  the  fact  that  in  the  calculation  of 
the  weight  gain  per  unit  area,  the  surface  area  of  the  entire  sample 
has  been  considered.  It  is  obvious  that  the  surface  area  of  the  TiC 
is  only  a  fraction  of  the  total  surface  area  of  the  sample  and  is 
proportional  to  the  volume  fraction  of  TiC  in  the  composite. 

The  activation  energy  for  the  oxidation  of  TiC  has  been  estimated  to 
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be  59  ±  6  kcal/mole.  This  is  greater  than  the  values  (46-49  ±  2 

6  7 

kcal/mole)  reported  earlier  ’  .  The  value  of  the  activation  energy 
obtained  in  this  work  is  comparable  to  the  activation  energy  for  the 

g 

self  diffusion  of  oxygen  in  TiOj  . 

The  effect  of  the  partial  pressure  of  oxygen  on  the  rate  of 
oxidation  of  TiC  is  shown  in  Fig.  4  and  5.  It  is  found  that  the  rate 
of  oxidation  increases  with  increasing  partial  pressure  of  oxygen. 
This  is  in  agreement  with  the  results  obtained  earlier  by  other 

7  9 

investigators  ’  . 

The  rate  of  oxidation  of  TiC  when  present  in  a  matrix  of  Zr02  was 

found  to  be  several  times  greater  than  its  rates  of  oxidation  when 

embedded  in  a  matrix  of  The  compact  became  very  fragile  and 

sometimes  even  powdery  after  oxidation  for  6  hours.  Fig.  6  shows  the 
results  obtained  from  our  studies.  It  should  be  pointed  out  that  for 
the  calculations  of  weight  gain/unit  area,  the  surface  area  of  the 

entire  sample  has  been  considered.  The  actual  surface  area  will  be 

different  from  this  in  view  of  the  fact  that  only  a  fraction  of  the 
surface  is  covered  by  the  TiC  and  that,  as  a  result  of  rapid 
oxidation,  the  sample  may  be  crumbling  and  thereby  exposing  fresh 
surface  to  the  atmosphere. 

The  oxidized  specimens  were  characterized  by  XRD.  In  the  case  of 
the  TiC-Al202  composites,  the  phases  identified  were  Ti02  (rutile)  and 
AI2O2,  while  those  in  the  case  of  the  TiC-Zr02  composites  were  Ti02 
(rutile)  TiC  and  Zr02  (monoclinic). 
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IV.  b.2  Oxidation  of  the  A/N-AigOj  and  A^N-ZrO;  Composites 

The  oxidation  rates  of  the  AIN-AI2O2  composites  were  found  to  be 
slower  than  those  of  the  TiC-AljOg  composites.  Figs.  7  and  8  show  the 
results  obtained  for  the  various  compositions  studied.  It  can  be  seen 
that  the  weight  gain/unit  area  in  the  case  of  the  composites 
containing  up  to  44  weight  percent  AIN  increases  with  increasing 
weight  fraction  of  AIN.  The  weight  gain/area  is,  however,  greater  for 
the  composite  containing  80  weight  percent  AIN  than  it  is  for  pure 
AIN.  These  results  are  similar  to  those  obtained  for  the  TiC-Al202 
composites,  and  therefore  the  discussion  pertaining  to  the  TiC-Al202 
system  applies  in  this  case,  too. 

The  XRD  results  showed  that  the  phases  present  in  the  oxidized 
specimen  were  AIN  and  AI2O2.  No  surface  cracks  were  detected  upon 
microscopic  (optical)  examination  of  the  oxidized  specimens.  If  one 
calculates  the  ratio  of  the  volume  of  the  AI2O2  formed  to  the  volume 
of  AIN  oxidized,  one  obtains  a  value  of  1.02.  Therefore,  the  change 
in  volume  of  the  composite  as  a  result  of  oxidation  of  AIN  is  very 
small. 

The  rate  of  oxidation  of  the  hot  pressed  composite  made  from  AIN 

and  Zr02  powders  (44  wt  7.  AfN  and  balance  Zr02)  was  very  rapid.  The 

product  of  oxidation  was  in  the  form  of  a  powder.  This,  at  first, 

seemed  surprising  in  view  of  the  fact  that  the  rate  of  oxidation  of 

pure  AIN  and  of  AIN-AI2O2  composites  was  very  slow.  However,  as 

stated  earlier,  the  presence  of  ZrN  and  aluminum  oxynitride  phases  was 

detected  in  these  hot  pressed  composites.  The  oxidation  rates 
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10  11 

of  ZrN  have  been  reported  to  be  very  rapid  ’  .  The  observed  high 
rates  of  oxidation  may,  therefore,  be  due  to  the  presence  of  ZrN  in 
the  composites.  The  results  are  shown  in  Fig.  9. 

The  lED  results  showed  that  the  phases  present  in  the  oxidation 
product  are  AI2O2  and  Zr02  (monoclinic).  It  is  interesting  to  note 
that,  though  the  hot  pressed  specimens  showed  the  presence  of  cubic 
zirconia,  the  cubic  zirconia  phase  was  transformed  to  the  monoclinic 
phase  during  oxidation  of  the  composite.  This  would  seem  to  suggest 
that  the  stabilization  of  the  cubic  phase  may  possibly  be  due  to  the 
incorporation  of  small  amounts  of  nitrogen  in  the  lattice  of  zirconia. 

IV.  b.3  Oxidation  of  TiC-TiB2  and  ZrC-ZrB2  composites 

The  oxidation  behavior  of  TiC-TiB2  composites  was  studied  at 
995®C  and  1430°C.  The  rate  of  oxidation  of  a  20  TiC-TiB2  composite  at 
995°C  as  well  as  at  1430°C  was  faster  than  that  of  pure  TiC  at  the 
same  temperature.  The  results  obtained  from  XRD  of  the  oxidized 
specimens  showed  that  the  product  of  oxidation  was  Ti02.  The  B2O2 
phase  could  not  be  detected.  The  rate  of  vaporization  of  B2O2  above 
1050°C  is  reported  to  be  high,  especially  in  the  presence  of  moisture. 
It  is,  therefore,  surprising  that  it  could  not  be  detected  in  a  sample 
which  was  oxidized  at  995°C  in  dry  air.  Similar  results  were  obtained 
in  the  case  of  the  TiC-TiB2  specimens  oxidized  at  1430°C.  Fig.  10 
shows  the  TGA  results. 
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The  results  obtained  from  the  oxidation  of  pure  ZrC  and  a 
20  ZrC-ZrB2  composite  in  air  at  995°C  are  shown  in  Fig.  11.  It  can  be 
seen  that  the  rate  of  oxidation  of  the  ZrC-ZrB2  specimen  is  slower 
than  that  of  pure  ZrC.  The  only  phase  detected  in  the  oxidized 
specimens  by  IKD  was  monoclinic  zirconia. 


V.  Summary  and  Conclusions 

1.  Analytical  evaluations  based  on  thermodynamic  calculations 
indicate  that 

(i)  AfN  should  be  chemically  compatible  with  a  matrix  of 

¥ 

ZrB2  or  HfB2,  and  possibly  with  TiB2  also. 

(ii)  TiC  is  compatible  with  TiB2,  but  there  exists  the 
possibility  that  it  may  react  with  ZrB2  and  HfB2. 

(iii)  TiN  should  be  compatible  with  a  matrix  of  Zr02 

or  Hf02. 

As  there  is  a  large  uncertainty  in  some  of  the  reported 
thermodynamic  data,  it  is  suggested  that  these  conclusions 
be  verified  experimentally. 

2.  Experimental  results  indicate  that  (i)  TiC  is  compatible 

with  a  matrix  of  ^^®2  ^^®2 

are  compatible  with  a  matrix  of  A^2^3  Hf02-  However,  the 

AfN-Zr02  composites  showed  the  presence  of  new  phases  (cubic 
Zr02,  ZrN  and,  in  some  cases,  aluminum  oxynitride). 
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3.  The  conditions  prevalent  during  hot  pressing  of  AfN  +  Zr02 
powder  mixtures  in  vacuum  seem  to  be  favorable  for  the 
stabilization  of  the  cubic  zirconia  phase  in  the  composite. 

4.  The  rate  of  oxidation  of  TiC  or  AfN  is  much  more  rapid  when 
it  is  embedded  in  a  matrix  of  Zr02  than  when  it  is  embedded 
in  a  matrix  of 

5.  The  rate  of  oxidation  of  a  ZrC-ZrB2  composite  at  995°C  in 
air  is  much  slower  than  the  rate  of  oxidation  of  pure  ZrC 
under  identical  conditions.  The  oxidation  rate  of  a 
TiC-TiB2  composite  is,  however,  faster  than  that  of  pure 
TiC. 
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Fig.l.  Oxidation  of  TiC-  Al„0.,  Composites  in 
air  at  800®C. 
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Fig. 2.  Oxidation  of  TiC  -  AI2OJ  composites  in  air  at  900°C. 
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Fig. 3.  Oxidation  of  TiC-  AI2O2  composites  in  air  at  9950C. 
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Fig. 4. 


Oxidation  of  TiC  at  995 °C  in  Air  and  in  Oxygen. 
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Oxidation  of  30  wt%  TIC-AI2O2  composite  at  995°C 
In  Air  and  In  Oxygen. 
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Fig. 6.  Oxidation  of  a  20  wt%  TiC-ZrO^  composite  in  Air. 
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Oxidation  of  AIN  and  A1N-A1„0,  composites  at 
900’C  in  air. 
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Fig.9.  Oxidation  of  a  42  wt%  AlN-ZrO^  composite  in  air 
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Fig. 10.  Oxidation  of  a  TIC  -  TiB-  composite  at  995  ° C  in 
air. 
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Fig. 11.  Oxidation  of  ZrC  and  a  20  wt%  ZrC-ZrB-  composite 
at  995 '’C  in  air. 
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SUMMARY 


This  research  is  aimed  at  synthesizing  two  diene-yne  compounds  which  are  to  be 
studied  for  their  cyl ization-aromatization  aoility.  The  two  compounds  should  give 
meta-  and  para-terpheny 1  if  aromatization  is  successful.  Future  oevelcpment  of 
these  systems  is  intended  for  the  production  of  thermally  stable  polymers. 

The  diene-yne  l,5-diphenyl-l-hexyne-3-5-diene,  a  compound  that  should  give 
m-terphenyl  when  heated,  has  been  synthesized  by  two  routes.  One  route  converts 
atropaldehyde  to  l-bromo-3-phenyl butadiene  by  a  Wittig  reaction  and  this  product  is 
then  coupled  with  phenyl acetylene.  The  second  route  involves  only  one  step  in 
which  atropaldehyde  reacts  with  the  ylide  based  on  3-bromo-l-phenylpropyne.  The 
second  route  gives  the  diene-yne  in  greater  yield  and  in  purer  form  although 
elemental  analysis  and  mass  spectral  analysis  indicate  the  presence  of  impurities. 

The  synthetic  route  to  l,4-diphenyl-l-hexyne-3, 5-diene  a  compound  that  should 
give  p-terphenyl  when  heated,  has  been  only  partially  completed.  Atropaldehyde  has 
been  successfully  brominated  to  give  2,3-dibromo-2-phenylpropanal .  The 
dehydropr-omination  of  this  compound  is  currently  under  studj. 
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I.  INTRODUCTION 


The  thermal  and  chemical  stability  of  aromatic  systems  has  motivated 
considerable  effort  over  the  years  toward  the  preparation  of  all-aromatic  polymers. 
These  are  polymers  which  contain  no  aliphatic  or  other  non-arylene  linking  units 
along  the  backbone  of  the  polymer  chain.  In  a  sense,  the  epitome  of  these  types  of 
polymers  is  poly  (paraphenylene) ,  I,  as  well  as  its  ortho  and  meta  isomeric  forms. 
Poly  (p-phenylene)  has  been  prepared  by  a  number  of  routes.  For  example,  most 
notably  it  has  been  obtained  from  the  aromatization  of  poly  (1,3-cyclohexadiene)  by 


Marvel  (1),  by  the  coupling  of  benzidinetetrazonium  chloride  (2),  by  the  reaction 
of  1,4-dilithiobenzene  with  titanium,  vanadium  or  cobalt  (3),  and  by  the  oxidative 
coupling  of  benzene  (4);  the  latter  an  innovative  method  developed  by  Kovacic  and 
co-workers. 

Poly  (p-phenylene)  as  generally  prepared  by  the  above  methods  has  been  found 
to  be  a  brown,  insoluble  and  infusible  material  having  fiber-forming  capability. 

It  is  stable  in  air  to  500“C  and  to  700®C  in  an  inert  atmosphere  (4).  Reliable 
molecular  weight  values  have  not  been  obtained  for  these  polymers  owing  to  their 
insolubility. 

Tnere  are  a  number  of  drawbacks  to  the  above  synthetic  routes.  Often  the 
polymers  are  obtained  having  extraneous  elements  (nitrogen,  chlorine)  as  the  result 
of  concurrent  side  reactions,  or  branching  may  occur.  The  mechanical  properties 
have  also  been  found  to  be  disappointing  which  may  be  due  to  incomplete 
arooiatization  or  to  low  molecular  weignt. 
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A  continuing  effort  has  been  put  forward  at  the  Polymer  Branch  of  the 
Materials  Laboratory  at  Wright-Patterson  Air  Force  Base  toward  the  synthesis  of 
polyarylene  polymers.  The  objective  has  been  the  preparation  of  a  pre-polymer  that 
is  soluble  and  processable  and  which  upon  heating  (curing)  would  give  an 
all-aromatic  structure.  An  all-paraphenylene  system  would  give  a  ric.d  rod 
configuration  having  the  potential  to  act  as  internal  reinforcement  as  discrete 
molecules  (molecular  composite  concept).  Certain  proportions  of  meta  or  ortho 
configurations  in  the  structure  might  improve  processability.  For  maximum 
effectiveness  the  pre-polymer  cure  concept  for  preparation  of  arylene  polymers 
demands  certain  requirements: 

1)  No  volatile  substances  should  be  evolved  during  the  curing  process. 

2)  No  catalysts  (especially  metals)  should  be  required  for  the  cure. 

3)  The  curing  reaction  should  involve  as  little  translational  molecular  motion 
as  possible. 

Requirements  1)  and  2)  are  needed  so  that  residual  artifacts  of  the  curing  process 
(gas  bubbles,  metals  from  a  catalyst)  do  not  cause  weakness  in  the  structure  or 
promote  chemical  decomposition.  Reouirenent  3)  is  a  kinetic  one.  It  results  from 
the  fact  that  the  unreacted  portion  of  the  functional  groups  taking  part  in  the 
curing  reaction  become  immobilized  as  curing  progresses.  This  can  be  compensated 
for  only  by  increasing  the  temperature  of  the  curing  process  so  that  chain 
translational  mobility  is  increased.  The  result  of  this  is  that  the  use 
temperature  of  the  polyme*"  (class  temperature)  can  be  no  higher  than  the  maximum 
cure  temperature  employed. 

In  order  to  minimize  the  translational  motion  required  in  the  curing  process, 
the  concept  at  the  WP  Polymer  Branch  has  been  to  investigate  systems  which  possess 
reacting  groups  located  in  close  proximity  to  each  other.  These  groups  are  to  be 
designed  to  yield  aromatic  structures  by  an  intramolecular  cyclization  process. 
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Some  success  has  been  achieved  (5)  with  2,2'-bis (phenylethynyl )  biaryl  systems  such 
as  II.  These  have  been  found  to  cyclize  upon  heating  to  give  dibenzoanthracene 
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structures  (III).  When  the  diethynyl  biaryls  were  incorporated  into  the  backbone 
of  poly (phenyl quinoxalines) ,  it  was  possible  to  obtain  a  cured  polymer  having  a 
glass  temperature  at  least  one  hundred  degrees  higher  than  the  cure  temperature. 

Continued  effort  along  these  lines  has  been  aimed  at  synthesizing  certain 
model  diene-yne  systems  which  could  be  examined  for  intramolecular  cyclization 
ability.  During  the  summer  of  1986,  the  writer  was  engaged  at  WPAFB  in  attempts  to 
prepare  compounds  IV,  V,  and  VI  which  were  intended  for  study  of 
cyclization-aromatization.  It  is  hoped  that  these  compounds  will  form  the  aromatic 
structures  shown  in  equations  2,  3,  and  4.  Tne  approaches  taken  to  prepare  these 
compounds  are  summarized  in  schemes  i  and  2. 
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Also,  it  was  not  possible  to  prepare  the  acetylenic  ketone,  IX.  In  Scheme  2, 
viny'phenyl  ketone,  XV,  proved  to  be  so  susceptible  to  polymerization  that  it  was 
not  possible  to  carry  out  its  conversion  to  the  bromodiene,  XVI,  even  when 
performing  the  Wittig  reaction  at  -78°. 

Ketones  behave  in  a  much  more  sluggish  manner  in  Wittig  reactions  tn-^n  do 
aldehyaes.  Also,  the  basic  nature  of  the  ylide  used  in  the  Wittig  synthesis  opens 
a  possibility  for  side  reactions  that  become  more  likely  vis-a-vis  the  desired 
reaction  with  the  ketone  group.  The  proposed  research  is  directed  at  preparing 
compounds  IV  and  VI  by  routes  involving  Wittig  reactions  with  aldehydes  rather  than 
ketones . 

II.  APPROACH 

The  synthesis  of  the  diene-yne  compounds  IV  and  VI  was  attempted  utilizing  the 

6-unsaturated  aldehyde,  atropaldehyde  as  the  starting  material  (Schemes  3,  4, 
and  5],  This  aldehyde  is  a  white  solid  which  tends  to  decompose  (polymerize  ?)  if 
kept  for  long  times  (>  4  hr.)  at  room  temperature,  but  it  can  be  stored  up  to  three 
weeks  at  -20°.  Atropaldehyde  can  be  prepared,  as  needed,  utilizing  the  procedure 
of  Crossland  (6)  (see  Scheme  6). 

The  preparation  of  compound  IV  followed  two  paths  as  shown  in  Schemes  3  and  5. 
Scheme  3  begins  with  atropaldehyde  and  in  two  steps  yields  the  desired  diene-yne. 
The  first  step  is  a  Wittig  reaction  with  bromometnylenephosphorane  (ylide)  to  give 
the  bromodiene  XVII.  This  is  then  coupled  with  phenyl  acetylene  using  a  procedure 
developed  at  WPAFB  (7)  to  give  diene-yne.  Tne  second  path,  shown  in  Scheme  5, 
involves  a  Wittig  reaction  between  atropaldehyde  and  the  ylide  obtained  from 
3-bromo-l-phenylpropyne  (XXII  and  XXIII), 

Tne  approach  to  compound  VI  is  shown  in  Scheme  4.  This  involves  the 
p’-eparation  o-^  bromodiene  XX  by  bromination  of  atr-opaldehyde  followed  by 
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dehydrobromination  and  a  Wittig  reaction.  The  bromodiene  is  then  coupled  with 
phenyl  acetylene  as  in  Scheme  3. 

Some  of  the  intermeciate  and  products  occurring  in  these  procedures  contain 
double  bonds  capable  of  existing  in  two  geometric  (cis-trans)  forms.  In  these 
preliminary  experiments  no  attempt  is  made  to  determine  stereochemical  identity  or 
to  perform  stereospecific  reactions. 

III.  EXPErllMENTAL  PROCEDURES 

Atropaldehyde :  The  procedure  of  Crossland  (6)  was  followed  without  major 
modification  (Scheme  6).  Infrared  and  proton  NMR  spectra  of  intermediates  and 
products  were  consistent  with  the  expected  structures.  The  last  step,  the 
hydrolysis  of  the  acetal  with  formic  acid/water,  had  to  be  done  quickly  and  all 
reagents  had  to  be  pre-chilled  before  use.  Crystallization  of  the  atropaldehyde 
from  diethyl  ether/petroleum  ether  was  accomplished  using  a  chloroform  slush  bath 
(-63®).  The  white,  crystalline  solid  melted  at  33-38®  (Lit.:  38-40®).  Samples  of 
atropaldehyde  were  prepared  in  8-10  g  batches  which  were  then  stored  at  -18®  until 
used  (storage  generally  not  longer  than  two  weeks). 

Triphenylbromomethylphosphonium  bromide:  This  salt  was  used  in  Scheme  3  to 
prepare  bromodiene  XVII.  The  procedure  of  Wolinsky  and  E'^ickson  was  followed  (8). 
Dibromomethane  (45.9  g,  0.264  moles),  triphenylphosphine  (30.1  g,  0.115  moles)  and 
250  mL  dry  toluene  were  refluxed  for  three  hours.  The  mixture  was  cooled  and 
pou-^ed  off  from  the  small  amount  of  dark  brown  oil  that  formed.  The  suoernetant 
was  then  refluxed  for  an  additional  21  hours,  cooled  to  room  temperature  and  the 


solid  isolated  by  filtration.  A  second  crop  of  solid  could  be  obtained  by  an 
additional  24  hour  reflux.  The  combined  solids  were  pumped  for  4  hours  at  48®  to 
give  21.7  g  of  tan  solid  (435),  mp  227-234'®  (Lit.:  240-241),  Recrystalliaation 
using  acetone/water  or  isopropyl  alcohol  gave  crystals  with  mp  238-243®  and 
237-244“  respectively. 
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Preparation  of  Bromodiene  XVII,  Scheme  3:  In  a  glove  box  (argon  blanket)  the 
following  were  prepared.  Mixture  A:  this  is  a  suspension  of  28.3  g  (0.065  moles) 
of  triphenylbromometh^lphosphonium  bromide  in  50  mL  dry  THF  placed  in  a  250  mL 
three-neck  flask  along  with  a  Teflon-coated  stir  bar.  The  necks  of  the  flask  were 
sealed  with  rubber  serum  caps.  Mixture  B:  this  is  a  solution  of  7.28  g  of 
potassium  t-butoxide  (0.065  moles)  in  50  mL  dry  THF  placed  in  a  dropping  funnel. 
Mixture  C:  is  a  solution  of  8.0  g  (0.060  moles)  of  atropaldehyde  in  50  mL  dry  THF 
placed  in  a  second  dropping  funnel.  All  containers  were  sealed  with  rubber  septa 
and  removed  from  the  glove  box.  Using  argon  purging  to  minimize  exposure  to  air, 
the  dropping  funnels  (mixture  B  and  C)  were  attached  to  the  three-neck  flask 
(mixture  A).  The  flask  was  cooled  to  -78®  using  a  dry  ice-isopropanol  bath  while 
stirring  and  maintaining  a  slow  argon  purge.  Mixture  B  was  added  slowly  to  A  over 
a  30  min.  period  while  stirring  and  maintaining  argon  purge  and  a  temperature  of 
-78®.  The  mixture  was  allowed  to  stir  for  an  additional  30  min.  (-78®).  This 
procedure  is  similar  to  that  of  Matsumoto  and  Kuroda  (9).  The  atropaldehyde 
solution  (mixture  C)  was  then  added  over  30  min.,  the  mixture  was  allowed  to  stir 
an  additional  50  min.  and  then  vacuum  filtered  quickly  through  a  large  (150  mL) 
medium  frit  funnel  (rinsed  with  3x10  mL  HPLC  grade  THF),  Dry,  filtered  solid  mass 
12.15  g. 

Tne  filtrate  was  poured,  with  stirring  into  400  mL  petroleum,  ether.  A  light 
brown  precipitate  (in  some  exoeriments  a  brown  sticky  paste  formed)  formed  which 
was  filtered  off  (d»-y  solid  mass:  16,50  g).  The  solvent  in  the  filtrate  was 
removed  by  rotary  evaporation  to  give  a  dark  amber  oil  (11.25  g).  To  this  was 
added  50  mL  petroleum  ether  which  induced  the  formation  of  a  bright  yellow 
precipitate  which  was  removed  by  filtration.  The  filtrate  was  again  concentrated 
to  4-25  g  of  a  cloudy-yellow  oil. 
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The  yellow  oil  was  kneaded  with  5.0  g  powdered  AgN03  petroleum  ether 

(removes  unreacted  atropaldenyae)  for  5-10  min.  and  using  suction,  the  mixture  was 
run  through  a  short  (1.5  cm)  column  of  silica  gel  using  elution  by  petroleum  ether. 
Twenty  ml  of  eluate  were  collected  and  this  was  vacuutti  concentrated  to  give  1.45  g 
of  a  nearly  colorless  (slightly  yellow)  oil.  A  second  AgN03/siliCc  gel  column 
treatment  yielded  0.70  g  of  the  oil. 

The  proton  NMR  of  the  oil  is  consistent  with  the  structure^  for  the  bromodiene 
XVII.  The  spectrum  is  shown  in  Figure  1.  The  principal  absorptions  are:  7.1  C 
(singlet,  5H) ,  6.6  6  (doublet,  0.8  H) ,  6.3  6  (doublet,  0.8  H) ,  5.5  6  (doublet, 

H) .  The  infrared  spectrum  showed  absorptions  at  3090  cm"^  (olefinic  C-H) ,  3040 
cm~^  (aromtic  C-H)  anc  900  cm~^  (terminal  «  CH2).  This  spectrum  is  shown  in  Figure 
2. 

Preparation  of  Diene-Yne  IV  by  Scheme  3:  Into  a  25  ml  three  neck  flask  were 
placed;  0.0469  g  Bis (triphenylphosphine)  palladium( II)  chloride,  0.0217  g  cuprous 
iodide,  10  ml  petroleum  ether,  5  ml  triethylamine  (previously  distilled  from  and 
stored  over  CaH2)  and  a  magnetic  stir  bar.  The  necks  of  the  flask  were  sealed  with 
rubber  septa  and  the  mixture  was  purged  for  15  min.  with  argon.  (Syringe  needles 
used  for  argon  inlet  and  exit).  To  the  stirred  mixture,  0.343  g  (3.36  mmoles)  of 
pnenyl acetylene  (vacuum  distilled  from  and  stored  over  Cafl2)  was  added  by  syringe. 
The  mixture  turned  black  and  a  black  sticky  solid  edherec  to  the  bottom  of  the 
flask.  One  neck  of  the  flask  was  briefly  opened  (argon  purge)  to  allow  breaking  up 
tne  solid  wicn  a  spatula.  To  tnis  mixture  was  added  0.616  g  (2.94  imoles)  of 
bromodiene  XVII  (syringe  injection  petroleum  ether  rinse).  The  necks  of  the  flask 
were  stoppered  and  the  mixture  allowed  to  stir  at  room  temperature  for  50  hours 
(flask  ctjvered  by  black  cloth). 

Tne  oark  Drown  reaction  mixture  was  diluted  witn  lu  mL  metnylene  chlorioe  and 
run  through  a  short  (1.5  cm,  4.0  g)  silica  gel  column  using  15  mL  methylene 
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chloride  as  rinse.  The  eluate  was  concentrated  b>  rota-^^  evaporation,  additional 
methvlene  chloride  was  adoed  and  again  evaporated  (twice  more)  in  order  to  remove 
most  of  the  triethylamine .  The  residue  remaining  (a  dark  brown  tar)  had  a  mass  of 
0.92  g.  Tr 's  material  was  transferred  to  a  1.5  x  10  cm  column  of  silica  gel  using 
a  minimum  amount  of  methylene  chloride  (2,  2,  1  ml  rinses)  and  elution  continued 
with  this  solvent.  A  dark  brown  band  eluted  between  6  and  15  ml  of  elution  volume. 
A  total  of  25  mL  was  collected  and  this  was  evaporated  and  then  pumped  at  0.5  mm  at 
room  temperature  for  two  hours  leaving  0.75  g  of  residue.  The  proton  NMR  of  this 
material  showed  at  least  three  large,  overlapping  peaks  at  7.3  6  (aromatic  H)  and 
many  small  peaks  between  1  and  3  6  (aliphatic  H)  and  between  5  and  7  6  (olefinic 
H). 

An  attempt  was  made  to  purify  this  product  by  selective  solubility.  It  was 
found  that  a  30/70  THF/methanol  mixture  caused  the  precipitation  of  some  dark  brown 
material.  Slow  gravity  filtration  through  a  medium  frit  funnel  and  evaporation  of 
the  solvent  (pumping  at  0.5  mm  4  1/2  hrs.)  gave  0.485  g  of  red-brown  tar.  The 
THF/methanol  solid  was  similarly  dried  to  give  0.052  g  of  brown  tar.  This  latter 
material  was  not  examined  further. 

The  NMR  and  FTIR  of  the  soluble  portion  are  shown  in  Figure  3  and  4.  Samples 
of  this  material  were  sent  for  C  and  H  analysis,  mass  spectral  analysis  and  DSC 
scan  (see  Results  and  Discussion). 

Preoeretion  of  Diene-Yne  IV  by  Scheme  5: 

A.  Preparation  of  3-bromo-I-phenylpropyne  (XXII):  In  a  250  mt  th’‘ee  neck  flask 
were  placed  25  g  (0.19  moles)  of  coninercial  3-phenyl -2-propyn-l-ol ,  90  ml  diethyl 
ether,  10  ml  pyridine,  and  a  magn€tic  stir  bar.  To  this  stirred  solution  was  added 
over  1  1/2  hours,  a  solution  of  25.65  g  (0.095  moles)  phosphorous  tribromide  in  25 
mL  diethyl  ethe^'.  The  mixtu-e  was  allw<#ed  to  sti'^  at  ’•oom  temperature  for  two 
hours  and  then  refluxed  for  two  hours.  After  cooling,  the  solution  was  poured  into 
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500  mL  ice  water.  The  layers  were  separated.  The  water  layer  was  given  two  washes 
with  50  ml  ether.  The  ether  layers  were  combined  and  given  three  washes  with  2% 
aqueous  sodium  carbonate.  The  organic  layer  was  dried  with  magnesium  sulfate  and 
then  concentrated  by  rotary  evaporation  to  give  19.86,  of  clear,  nearly  colorless 
liquid  (501  yield).  The  proton  NMR  of  the  liquid  is  consistent  with  the  structure 
of  3-bromo-l-phenylpropyne;  singlet  at  7.4  6  (aromatic  K,  5H) ,  singlet  at  4.2  6 
(-CH2Br,  2H) .  (See  Figure  5). 

B.  Preparation  of  Triphenylphosphonium  salt  of  3-bromo-l-phenj*  Ipropyne 
(precursor  to  ylide  in  Scheme  5):  Into  a  500  ml  one  neck  flask  were  placed  12.0  g 
(C.06  moles)  of  the  bromo  compound  prepared  in  Part  A  above,  21.0  g  (0.08  moles)  of 
tripheny' p-:csphine ,  175  mL  dry  toluene  and  a  magnetic  stir  bar.  The  mixture  was 
stirred  and  refluxed  for  4  hours.  The  flask  was  stoppered  and  refrigerated 
overnight.  To  the  mixture  was  added,  with  stirring,  70  mL  cyclohexane.  The  slurry 
was  run  through  a  coarse  frit  funnel  (slow  gravity  filtration)  and  the  solid  rinsed 
with  2  X  50  mL  cyclohexane.  The  solid  was  then  sucked  dry  by  vacuum  and  then 
pumped  at  0.5  mm  30-40®  for  4  hours  and  then  21/2  hours  at  50®.  Mass  of  the  tan 
salt  was  28.15  g  (100%  yield).  The  melting  point  found  was  200-210®  (192®  reported 
by  WPAFB  technician). 

C.  Preparation  of  diene-yne  IV,  Scheme  5:  Into  a  250  mL  three  neck  flask  were 
placed  13.26  g  (0.029  moles)  of  the  phcsphonium  salt  of  part  B  above,  100  mL  dry 
THf,  and  a  magnetic  stir  bar.  An  argon  inlet  was  attached  and  the  remaining  necks 
were  sealed  with  rubber  septa  (a  syringe  needle  used  as  argon  ex'.t).  Tne  mixture 
was  stirred  and  cooled  to  0®  and  then  12  mL  of  a  2.5  M  solution  of  n-butyll ithium 
in  hexanes  (C.C3  moles)  was  added  by  syringe  over  35  minutes.  Ttie  mixture  turned  a 
very  dark  red  (almost  black).  The  ice  bath  was  removed  and  the  mixture  was  allowed 
to  warm  to  room  temperature  over  25  rnin.  Tne  ice  oam  was  replacec  anc  into  one- 
neck  of  the  flask  was  placed  a  dropping  funnel  containing  3.3  g  (0.025  moles)  of 
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atropaldetivde  in  20  mL  ar^'  THF.  Tnis  solution  was  added  over  25  min.  and  then  the 
mixture  allowed  to  stir  at  0“  for  1  1/2  hrs,  and  then  at  room  temperature  for  4 
hrs . 


The  mixture  was  poured  with  st'-ring  into  350  ml  "hexanes"  to  give  a  large 
lump  of  sticky’  material.  The  liquid  phase  was  run  through  a  large,  coarse  frit 
funnel.  The  mass  of  the  air  dried  sticky  solid  was  15.4  g  and  this  material  was 
not  examined  further.  The  above  "^iltrate  was  concent’^ated  by  rotary  evaporation  to 
give  3.75  g  of  a  thick,  red,  tarry  residue. 

Partial  purification  of  this  material  was  accomplished  by  two  column 
chromatography  procedures.  The  first  chromatography  was  performed  on  a  2.5  x  4  cm 
silica  gel  column  and  the  material  collected  in  300  ml  of  35/65  cyclohexane/CH2Cl 2 
gave  2.64  g  of  residue.  This,  in  turn  was  run  through  a  short  (2.5  x  2.5  cm) 
column  of  silica  gel,  using  first  cyclohexane  as  solvent  (200  ml  eluted)  and  then  a 
50/50  cyclohexane/CH2Cl2  mixture  (300  ml  eluted).  The  cyclohexane  elution  gave 
approximately  0.5  g  of  dark  brown  tar  (not  examined  further)  and  the  solvent 
mixture  elution  gave  1.30  g  of  a  yellow,  crusty  solid  (232).  The  proton  NMR, 

NMR  and  FTIR  of  this  material  are  shown  in  Figures  6,  7,  and  8.  Elemental  analysis 
mass  spectrometry  and  differential  scanning  calorimetry  were  also  perfomred  (see 
Results  and  Discussion). 


Preoeretior  of  Dib^omoetropclcehyGe  fcompounc  XVIII,  Sheme  4):  Into  a  100  mL 
flask  were  placed  7.15  g  (C.054  mcles)  etropeldehyde,  25  mL  COI4  and  a  magnetic 
stir  bar.  A  drot^pinc  funnel  containing  £.95  c  (C.056  moles)  bromine  in  25  ml  ZZ'4 
was  attached  to  the  flask.  The  flask  was  cooled  to  0®  and  while  stirring  the 
mixture,  the  bromine  solution  was  added  over  40  min.  The  mixture  was  stirred  for 
30  min.  and  allowed  to  warm  to  room  temperature.  The  red  reaction  mixture  was 
washed  with  3  x  30  mL  202  .NaHS03  solution,  3  x  30  mL  saturated  NaHC03  solution,  and 
2  X  50  mL  water.  The  organic  layer  was  dried  with  Na2S04  followed  by  MgS04  and 
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then  rotary  evaporated  to  give  12.03  g  (761)  pale  yellow  liquid.  The  proton  NMR 
and  FTIR  of  this  material  are  shown  in  Figures  9  and  10.  Elementol  analysis  was 
obtained . 

Attempted  Preparation  of  Bromoaldehyde  XIX  (Scheme  4):  A  procedure  similar  to 
that  of  Bailey  and  Feng  (10)  was  followed  in  which 

l,8-dia2abicyclo(5,4,0)undec-7-ene  (DBU)  was  used  as  the  base.  Into  a  250  ml  3 
neck  flask  were  placed  10.0  g  (0.0342  moles)  of  the  dibromoatropalaehyde  (XVIII), 

40  ml  anhydrous  diethyl  ether  and  a  magnetic  stir  bar.  To  this  were  attached  an 
argon  inlet  and  exhaust  and  a  dropping  funnel  containing  5.21  g  (0.0342  moles)  DBU 
in  80  ml  anhydrous  ether.  The  flask  was  cooled  to  0®  and  stirred  while  the  DBU 
solution  was  added  over  2  hrs.  The  mixture  was  then  allowed  to  stir  and  warm  to 
room  temperature  for  an  additional  1  1/2  hrs.  The  mixture  was  run  through  a  medium 
frit  funnel  (ether  rinse)  and  the  filtrate  rotary  evaporated  to  give  2.9  g  of 
residue  (405  crude  yield).  The  protoh  NMR  spectrum  of  this  material  is  shown  in 
Figure  11. 

IV.  RESULTS  AND  DISCUSSION 

A.  Preparation  of  diene-yne,  IV,  by  Scheme  3:  The  synthesis  of  the 
bromociene,  XVII,  proved  to  be  tne  mosi  difficult  step  in  the  scheme.  Numerous 
attempts  were  made  using  various  modifications  of  tne  Wittig  procedure.  The  most 
conspicuous  and  detrimental  feature  of  this  step  was  tne  tendency  of  atropaloehyoe 
to  polymerize.  This  could  be  seen  in  the  large  amounts  of  sticky,  brown  materiel 
which  precipitated  when  the  filtered  reaction  mixture  was  poured  into  petroleum 
ether.  Generally  tne  mass  of  tnis  material  was  greater  than  the  amount  of 
tri phenyl phosphirve  oxide  expected  from  the  reaction.  Evidently  the  basicity  of  the 
ylide  was  sufficient  to  initiate  the  anionic  polymerization  of  atropaldehyde. 

In  Bd0'>tio'n  to  tne  above  handicap,  some  atropaloenyoe  remained  unt*eacted  and 
appeared  in  the  crude  product.  This  last  trace  of  aldehyde  could  be  removed  by  the 
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silver  nitrate  treatment  (see  Experimental  Procedure).  The  final  purified  product 
was  generall)f  obtained  as  a  colorless  oil  in  sniall  yields  of  5-lOt,  however,  the 
proton  NMR  and  infrared  spectra  agree  quite  well  with  the  structure  of  XVII 
(Figures  1  and  2).  Elemental  analysis  on  this  compound  was  not  obtained. 

The  coupling  reaction  of  the  bromodiene  with  phenyl  acetylene  catalyzed  by  the 
palladium  complex  and  cuprous  iodide  gave  71t  crude  product.  This  material  is  a 
reddish-amber  paste.  The  proton  NMR  (Figure  3)  shows  one  large  peak  (overlapping 
peaks)  at  7. A  C  (aromatic  protons)  a  very  weak  absorption  at  6.5  6  (olefinic 
protons)  and  several  small  peaks  between  2  and  4  6  probably  due  to  impurities. 
Olefinic  proton  peaks  could  not  be  discerned.  The  infrared  spectrum  (Figure  4) 
shows  the  expected  absorption  between  3000  and  3100  cm"l  for  sp^  C-H  stretch,  a 
weak  (broad)  absorption  near  2280  cm"^  probably  due  to  -CEC-  stretch,  a  moderate 
absorption  of  912  cm""^  likely  due  to  -CH2,  and  certain  other  features  (-C-C- 
stretch  near  1600  cm  and  aromatic  bend  at  753  and  693  cm“^)  consistent  with  the 
structures . 

The  mass  spectrogram  of  this  material  shows  a  great  deal  of 
l,i-di phenyl hexadiyne (mass  202  relative  intensity  -  lOCi)  and  some  of  the  desired 
diene-yne,  IV  (mass  230,  RA  -  2.165).  Tne  diyne  is  a  oy-proauct  known  to  oe 
created  when  phenyl  acetylene  interacts  in  a  rediictive  coupling  reaction  with 
palladium  compounds  (11).  Tne  elemental  analysis  showed:  5C,  82.24,  82.2^;  IH, 
5.50,  5.52  (Theoretical  5  C:  92.8;  58:6.1). 

Thus,  the  synthetic  route  Depicted  in  Scheme  3  is  capable  of  giving  toe 
desired  diene-yne,  however,  the  overall  yield  is  quite  low  and  the  final  material 
is  not  pure.  The  synthetic  steps  would  have  to  be  greatly  increased  in  scale  in 
order  to  obtain  enough  material  to  explore  various  purifi cat ion /separation 
procEOures. 


80-19 


One  additional  aspect  of  Scheme  3  is  that  the  Pd/Cu  catalyzed  coupling  step 
may  result  in  metal  residues  occurring  in  the  final  product.  It  was  oPserved  in 
this  study  that  solutions  of  the  material  in  deuterated  chloroform  (NMR  samples) 
tended  to  give  a  small  amount  of  fine,  yellow  precipitate  on  standing  overnight. 

No  attempt  was  made  to  analyze  this  material  and  identify  it  as  metallic,  however, 
all  organic  products  (including  by-products  and  impurities)  were  quite  soluble  in 
chloroform.  If  such  metallic  materials  were  to  occur  in  the  polymeric  analog  of 
the  diene-yne,  they  might  result  in  certain  adverse  consequences.  For  example, 
they  may  catalyze  oxidative  degradation  of  the  polymer  at  high  temperature. 

E.  Preparation  of  diene-yne,  IV,  fay  Scheme  5:  This  is  a  relatively  simple 
procedure  that  can  produce  IV  in  one  step.  The  disadvantage  of  this  procedu-^e  is 
that  the  key  step  is  a  Wittig  reaction  which  is  notoriously  poor  in  giving  products 
in  high  yields.  If  used  as  a  polycondensation  reaction,  this  approach  would  not 
easily  give  rise  to  high  molecular  weight  material.  Nevertheless,  the  scheme  does 
not  utilize  heavy  metals  and  its  simplicity  makes  it  an  attractive  alternative  to 
Scheme  3. 


Compared  to  the  procedure  in  Scheme  3,  the  one  step  procedure  of  Scheme  5 
gives  the  final  product  in  four  times  the  quantity  (based  on  atropal dehyae) . 
FurtherTnore,  the  final  product  appears  to  be  a  more  pure  form  of  tne  diene-yne. 

The  elemental  analysis  gave:  5C,  85.21,  85.50;  SK,  5.52,  5.95  (Thee:  SC,  52.8,  SH 
£.1).  Tne  FTIR  spiectrum  (Figure  8)  gives  absorptions  wr.icn  ere  in  agreement  witn 
the  structure  of  IV  (3000-3100  ctr"^  for  sp^  C-H  streten,  2247  cm"-^  for  -CEC- 
stretch,  1643  cm~^  for  -C-C  stretch  and  912  cm"^  for  -CH2  bend)  however,  there  are 
some  absorptions  that  seem  to  be  due  to  contaminants  (2931  emr^  sp^  C-Fl  stretch  and 
1722  cm"^  for  C*0  stretch).  The  proton  NMR  spectrum  (Figure  6)  is  similar  to  tnat 
of  tne  prtxiuct  of  Scr.eme  3.  This  snows  a  ve^y  strong  aromatic  proton  absorption 
(several  overlapping  peaks)  at  7.5  6  but  very  weak  absorptions  in  the  5-7  6  region 
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where  the  olefinic  hydrogens  are  expected.  One  explanation  (though,  perhaps  not 
valid)  for  the  apparent  lack  of  significant  olefin  hydrogen  absorption  is  that 
these  hydrogens  are  experiencing  aromatic  ring  current  deshielding  effect  that 
causes  them  to  absorb  in  the  aromatic  region.  Some  peaks  in  the  6-7  6  region  of 
the  spectrum  indeed  seem  to  merge  with  the  strong  aromatic  peak.  The  NMR 
spectrum  (Figure  7)  likewise  shows  primarily  only  the  absorption  due  to  aromatic 
carbons  in  the  120-130  ppm  region.  A  small  negative  absorption  is  seen  at  85  ppm 
which  is  in  the  region  for  acetylenic  carbon  absorption. 

Thin-layer  chromatography  of  this  material  (20/80  cyclohexane/chloroform  on 
silica  gel)  shows  what  appears  to  be  two  intensly  fluorescent  overlapping  spots 
(cis-trans  isomers?)  at  Rf  -  0.7  plus  two,  very  faint  spot:  ,  t  Rf  »  0.41  and  0.32. 
There  was  a  slight  tailing  pattern  behind  the  spots  which  may  suggest  some  reaction 
occurring  between  the  compound  and  the  silica  gel. 

The  mass  spectrum  for  this  compound  shows  a  relative  intensity  of  25.78 
(comparer  to  CgHs'*’  RA  -  100)  for  mass  230  corresponding  to  the  diene-yne. 

Actually  mass  230  gives  the  most  intense  peak  above  200. 

C.  Differential  scanning  calorimetry  of  diene-yne  samples  prepared  by  Schemes 
3  and  5:  DSC  information  on  the  diene-ynes  should  be  very  useful  in  determining 
whether  cyclization  is  taking  place.  The  DSC  scan  for  the  Scnene  3  product 
(designated  as  sample  RP77-2)  is  shown  in  Figure  12-  This  shows  a  slight  endotherm 
at  63. 2*^  and  then  a  la^-ge,  broad  exotherm  at  100-250*’.  A  second  scan  of  this 
material  (first  scan  taken  to  450®}  gives  essen-tially  a  flat  line  with  a  very 
slight  endothermic  trar.sitiofi  at  214®.  No  enootnerm  was  observed  at  66-£7'®,  tne 
melting  region  for  m-terphenyl  which  is  tne  expected  product  of  cyclization. 

The  DSC  scan  for  the  Scheme  5  product  (designated  as  sample  RP8D-1)  is  shown 
in  Figure  15.  A  ratner  different  behavior  is  observed  in  comparison  to  the  Scheme 
5  product.  A  weak  enootherm  appears  at  53®  which  is  inmediately  followed  by  a 
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Droad  exotherm  from  90  to  170®  and  another  very  broad  exotherm  from  170  to  366®. 
The  second  scan  shows  no  discernible  transition;  a  disappointment  in  that  the 
melting  point  for  m-terphen^l  was  not  detected. 

Continued,  more  careful  DSC  work  on  this  sample  is  to  be  carried  out.  It  may 
be  that  since  the  first  DSC  scan  was  carried  out  to  450®,  the  m-terphenyl  which 
formed  may  have  decomposed  or  sublimed  out  of  the  sample  holoe’".  It  has  been 
suggested  to  the  chemists  at  WPAFB  that  the  first  DSC  scan  be  carried  out  to  no 
higher  than  3D0®  before  running  the  second  scan. 

D.  Synthesis  of  diene-yne  VI  (Scheme  4):  This  sequence  of  steps  is  only 
partially  complete.  Atropaldehyde  has  been  successfully  brominated  to  give 
2,3-dibromo-2-phenylpropanal  (compound  XVIII)  in  fair  yield.  The  proton  NMR  and 
FTIR  of  this  material  (Figures  9  and  10)  are  consistent  with  the  structure. 
Elemental  analysis  gave  38.88%  C,  2.93%;  H,  49.75%  Br  (Theo.  37.02%  C,  2.76%  H 
54.74%  Br). 

The  dehydrobromi nation  of  XVII  to  form  5-bromo-2-phenylpropenal  (XIX)  was 
attempted  using  a  variety  of  bases  such  as  hot,  solid  K2CD3,  aqueous  KOH  (with 
phase  transfer  catalyst)  and  DBU,  This  latter  organic  base  was  used  successfully 
by  Bailey  and  co-worxers  (10)  to  perform  a  oehydrobromination  in  the  synthesis  of 
c<,£-unsaturated  esters.  Only  one  attempt  has  been  made  at  this  type  of 
oenydrobrominetioh.  The  current  material  in  hand  has  not  been  purify&iin  any  wa.\ . 

A  p'^cn  NMR  spectrum  is  shown  in  Figure  II.  Further  work  is  planned  to  purify 
this  mate’"ia'  by  column  chrometograpny  anc  cc  repeat  the  synthesis  on  a  larger 
stale- 

V.  CONCLUSIONS 

Two  routes  to  the  synthesis  of  the  diene-yne.,  IV.,  have  been  develooed.  The 
one-step  route  using  a  Wittig  reaction  on  atropalognyce  gives  the  ossirec  proauct 
in  greater  overall  yield  and  in  purer  form.  Although  mass  spectral  data  and 
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infrared  spectra  indicate  the  presence  of  tne  diene-yne,  tne  proton  and  NMR 
spectra  do  not  indicate  the  presence  of  olefinic  or  acetylenic  hydrogens.  This 
synthesis  should  be  repeated  on  a  larger  scale  in  order  to  develop  methods  for 
purification  and  to  more  fully  characterize  the  material. 

Careful  DSC  studies  should  be  performed  on  this  compound  in  order  to  determine 
the  region  (temperature)  of  cyclizatior.  and  the  extent  of  formtion  of  m-terphenyl . 

As  an  intermediate  needed  in  the  synthesis  of  diene-yne  VI, 
2,3-dibromo-2-phenylpropanal  could  be  prepared  in  good  yield  from  atropaldehyde . 

The  dehydrobromination  of  this  compound  using  DBU  appears  to  be  successful  although 
purification  procedures  need  to  be  conductec. 
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ABSTRACT 

A  device  was  designed  and  constructed  to  generate  leaky 
guided  waves  in  the  submerged  composite  materials,  addressing 
some  of  the  problems  associated  with  the  existing  appartus. 
Particular  featu..e  of  the  design  is  that  at  different  orientation 
of  the  transducers  the  point  of  insonif ication  remains  the  same. 
The  measurements  were  made  with  both  the  leaky  surface  and  plate 
waves , 

Experimental  measurements  of  leaky  Rayleigh  waves  on  a  fluid- 
composite  solid  interface  are  presented,  and  the  results  are 
compared  with  those  of  a  mixture-continuum  model.  A  laminated 
microstructure  is  simulated  by  fabricating  a  solid  from  a  stack 
of  alternating  layers  of  stainless  steel  and  copper  sheets. 
Leaky  Rayleigh  waves  are  propagated  in  the  lamination  direction 
with  their  particle  motion  in  the  plane  of  the  layers.  At  the 
intermediate  frequencies  the  Rayleigh  waves  are  observed  to  be 
dispersive,  and  the  frequency  dependence  is  related  to  the  ratio 
of  wave  length  and  layer  thickness  (cell  size).  For  long 
wavelengths  the  surface  wavespeed  attains  the  mixture  value  for 
the  two  constituents.  For  short  wavelengths,  the  Rayleigh  wave 
velocity  asymptotically  approaches  the  values  for  the  constituent 
having  the  slower  surface  wave  velocity  in  the  absence  of 
microstructure . 
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INTRODUCTION 


The  research  undertaken  here  was  continuation  oE  the  author's 
work  at  the  materials  laboratory  (WPAT/AFB) ,  which  he  visited 
during  the  summer  of  1986  and  subsequently  in  1987.  The 
principal  interest  was  in  launching  and  receiving  the  leaky 
guided  waves  in  unidirectional  fibrous  composites.  One  of  the 
problems  encountered  in  the  measurement  system  was  that  as  the 
transducer  orientation  changed  the  point  of  insonif ication 
changed  also.  The  transducer  then  has  to  be  translated  to  reach 
to  the  original  point.  A  simple  device,  though  similar  to 
goniometer,  was  devised  to  circumvent  the  above  problem.  The 
interest  then  was  to  study  the  off-angle  propagation  of  Lamb 
waves  in  the  unidirectional  fibrous  composites.  Later  this  work 
was  discontinued,  under  the  instruction  from  the  materials 
laboratory,  for  the  surface  wave  studies. 

The  Rayleigh  waves  propagated  in  fibrous  medium  exhibit 
dispersion  similar  to  the  bulk  waves.  To  aid  in  the  studies  of 
such  composites,  a  simpler  two  dimensional  model  was  suggested 
composed  of  layers  of  two  different  materials.  Thus  specimens 
exhibiting  laminar  microstructure  were  constructed  and  the 
dispersion  studies  were  carried  out. 

OBJECTIVE  OF  THE  RESEARCH  EFFORT; 

The  overall  objective  of  this  particular  program  is  to 
conduct  basic  research  in  acoustics  and  follow  it  up  with  the 
necessary  technological  developments  for  the  advancement  of  NDE 
and  imaging  techniques  to  meet  the  nations's  Air  and  Strategic 
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defense  needs 


Our  particular  objective  was  to  further  the  measurement 
technique  for  leaky  Raleigh  waves  in  dispersive  media.  The 
specific  problems  addressed  were: 

1.  Design  and  construction  of  a  device  to  launch  and  receive 
leaky  waves  without  affecting  the  change  of  measurement 
location,  a  handicap  of  existing  devices. 

2.  The  construction  of  specimen  exhibiting  laminar 
microstructure,  and  measure  Rayleigh  wave  dispersion. 

INTRODUCTION  TO  THE  TOPIC  AND  APPROACH 

The  measurement  of  Rayleigh  wave  velocity  in  dispersive  media 
presents  a  challenge  since  the  conventional  technique  of 
measuring  time  of  flight  with  wedge  transducers  would  require  a 
different  transducer  for  each  frequency  of  the  measurement.  Even 
more  troublesome  is  the  distortion  of  the  short  tonebursts  used. 
The  dispersive  media  clutters  the  echo,  making  it  difficult  to 
measure  the  travel  time  with  accuracy,-  the  same  difficulty  is 
encountered  in  measuring  dispersion  of  the  bulk  waves.  A  more 
efficient  scheme  employed  by  Chimenti  [4,5]  consists  of  resonant 
generation  of  leaky  Rayleigh  (and  Lamb)  waves  by  immersed 
transducers  launching  acoustic  waves  at  critical  incident  angles 
to  the  liquid-solid  interface.  If  conditions  of  frequency, 
beamwidth,  and  radiation  damping  coefficient  are  favorable,  a 
bimodal  reflected  acoustic  field  is  created,  composed  of  a 
remnant  specular  reflection  summed  coherently  with  the  radiating 
surface  wave.  In  that  case,  a  portion  of  the  field  distribution 


81-4 


shows  a  strong  reduction  in  amplitude/  due  to  phase  cancellation 
of  component  fields.  The  technique  of  acoustically  seeking  a 
null  zone  in  the  reflected  field,  caused  by  phase  cancellation 
between  leaky  surface  energy  and  the  specular  reflection  provides 
a  convenient  and  clear  means  of  establishing  the  presence  of  a 
surface  wave  at  the  appropriate  frequency. 

A  new  fixture  was  designed  in  which  the  transducer  holder 
slides  in  a  slotted  arc  such  that  the  center  of  arc  could  be 
maintained  on  the  surface  of  the  transducer.  In  this  case,  at 
different  orientation  the  point  of  insoni f ication  remains  the 
same,  considerably  easing  the  measurements  compared  to  the 
earlier  set  up.  The  details  of  the  designs  are  described  in  the 
next  section. 

Various  mixture-continua  theories  have  been  proposed  to  model 
elastodynamics  of  composite  materials.  Experimental  support  is 
needed  both  to  aid  in  formulating  as  well  as  in  verifying  these 
models.  It  requires  devising  experiments  and  constructing 
practical  specimens  that  would  simulate  the  desired  material 
characteristics.  In  what  follows,  a  one-dimensional  laminar 
microstructure  is  simulated  by  constructing  samples  with 
alternating  layers  of  copper  and  stainless  steel  sheets.  The 
dispersion  of  leaky  Rayleigh  waves  is  studied  on  such  a  sample. 

DESIGN  OF  FIXTURE 

The  fixture  consists  of  two  aluminum  support  plates  mounted 
on  a  channel  across  the  Lank  as  shown  in  Fig.  1.  One  of  the 
plates  is  allowed  to  slide  along  the  length  of  the  channel,  while 
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the  other  is  held  stationary.  The  channel  is  leveled  with 
respect  to  the  base  plate  of  the  tank  to  obtain  proper  alignment. 
The  transducer  assemblies  are  attached  to  the  support  plate  at  a 
curved  slot  which  allows  the  assembly  to  rotate.  Both  the 
transmitting  and  receiving  transducers  are  held  by  means  of  a 
tube  which  also  serves  as  a  conduit  for  the  transducer  cables. 
An  0-ring  between  the  tube  and  the  transducer  keeps  the  water 
from  leaking  into  the  tube.  The  receiving  transducer  is  attached 
to  the  sliding  support  plate,  and  the  transmitting  transducer  to 
the  stationary  one. 

Motion  of  the  sliding  support  plate  is  provided  by  a  screw 
and  nut  attachment  housed  within  the  channel.  Hence,  the  sliding 
plate  with  the  receiving  transducer  effectively  provides  the 
horizontal  movement  required  to  locate  the  null  energy  region  of 
the  reflected  field.  Variable  angle  settings  can  be  selected  for 
each  of  the  transducers  by  means  of  a  rack  and  gear  mechanism 
located  in  the  curved  slots  of  the  support  plates.  The  reference 
point  for  the  measurement  of  the  angle  is  the  vertical  axis,  thus 
the  point  of  insonif ication  remains  the  same  for  all  transducer 
positions.  The  curved  rack  is  positioned  on  the  back  of  each 
support  plate  where  it  is  engaged  with  a  gear  wheel.  By  moving 
the  gear  wheel  along  the  rack,  the  desired  angle  settings  for  the 
transducer  can  be  achieved.  The  range  of  angles  that  can  be 
attained  with  this  arrangement  is  from  5  to  65  degrees  with 
respect  to  the  vertical  axis.  The  angle  settings  are  accurately 
marked  on  the  front  of  the  support  plate.  Therefore,  this 
arrangement  selects  different  angle  settings  precisely  and 
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quickly.  All  the  components  of  this  fixture  are  anodized  for 
corrosion-resistance . 

The  forerunner  of  this  fixture  had  a  thinner  support  plate 
and  did  not  have  the  rack  and  gear  mechanism  for  selection  of 
different  angles.  It  also  did  not  have  the  nut  and  screw 
arrangement  for  the  sliding  motion  of  the  support  plate. 
Selection  of  different  angle  settings  were  not  as  quick,  and 
there  was  always  a  chance  of  committing  errors  with  a  protractor. 

A  plexiglass  base  plate  equipped  with  leveling  screws  is 
placed  at  the  bottom  of  the  tank.  The  specimen  is  then  placed  on 
the  top  of  the  base  plate. 

SAMPLE  PREPARATION 

At  first,  we  considered  fabricating  the  specimen  out  of  the 
layers  of  urethane  and  epoxy.  However,  problems  of  accurately 
determining  exact  elastic  properties  of  cured  epoxy  and  the 
potential  problem  of  the  poor  quality  of  the  interfaces  suggested 
using  all-metal  constituents.  The  best  choice,  considering  their 
ready  availability,  was  sheets  of  copper  and  stainless  steel.  A 
diffusion  bond  would  have  been  an  ideal  choice  for  forming  a 
solid  out  of  these  sheets,  but  the  idea  was  found  co  be 
impractical.  Instead,  we  constructed  a  clamping  fixture  to  keep 
the  plates  pressed  together,  assuring  uniform  contact  that  would 
realize  the  welded-  contact  boundary  condition  for  both  the 
stresses  and  the  displacements. 

The  sheets  were  degreased  and  etched.  After  clamping  the 
plates  tightly,  the  assembly  was  heated  in  an  oven  for  two  hours 
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at  250°  centigrade.  Considerable  care  in  machining  was  taken  to 
prepare  the  top  surface  without  deforming  the  cells  and  to 
achieve  a  mirror  finish.  Fig.  2  shows  the  photograph  of  one  of 
these  samples,  where  the  actual  cell  size  is  0.787  mm.  The  sharp 
interfaces  between  the  two  constituents  and  the  high  degree  of 
regularity  in  the  laminates  are  clearly  seen  in  this  figure. 

MEASUREMENT  DESCRIPTION 

A  schematic  of  the  experimental  geometry  is  given  in  Fig.  3, 
An  ultrasonic  beam  from  a  damped  piston  source  insonifies  the 
sample  at  selected  incident  angles,  each  of  which  corresponds  (by 
Snell's  law)  to  a  particular  trace  velocity.  At  frequencies 
corresponding  to  the  generation  of  Rayleigh  waves  in  the  sample, 
the  reflected  ultrasonic  field  will  show  the  displacement  and 
distortion  characteristic  of  the  presence  of  a  propagating  wave 
on  the  surface.  At  frequencies  away  from  critical,  the  signal  is 
dominated  by  the  specular  reflection.  An  initial  set  of  measure¬ 
ments  was  made  on  a  pulsed  Schlieren  system  both  to  build 
confidence  in  our  ability  to  launch  surface  waves  on  such  a 
sample  and  to  verify  the  presence  of  dispersion.  Fig.  4  shows 
the  Schlieren  visualization,  which  also  provides  a  vivid  picture 
of  the  null  zone  described  earlier.  To  study  surface  wave 
dispersion,  data  are  acquired  in  a  high-resolution  ultrasonic 
scanning  system  by  exciting  a  broadband  ultrasonic  transducer 
with  low-level  rf  tonebursts  from  a  function  generator.  The 
reflected  acoustic  beam  is  detected  by  a  second  matched 
transducer  positioned  at  the  same  angle  and  about  the  same 
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distance  to  the  sample.  Spectra  are  accumulated  by  discretely 
stepping  the  frequency  and  recording  the  reflected  signal.  A 
typical  experimental  spectrum  for  an  incident  angle  of  43°  is 
shown  in  Fig.  5.  The  dashed  curve  is  the  unprocessed  experi¬ 
mental  signal,  while  the  solid  line  shows  the  same  data 
normalized  to  remove  the  transducer  response.  At  9.35  MHz,  there 
is  a  strong  reduction  in  amplitude,  indicating  the  presence  of  a 
leaky  Rayleigh  wave.  The  test  is  repeated  over  a  range  of 
transducer  angles,  and  the  corresponding  Rayleigh  wave  frequency 
is  determined. 

RESULTS  AND  DISCUSSION 

The  new  design  of  transducer  holder  performed  satisfactorily 
and  provided  ease  in  the  measurements  as  expected.  However  there 
is  room  for  improvement;  the  gear  retards  free  motion,  for  finer 
adjustment . 

The  results  of  experimental  measurements  n  i  ultrasonic 
reflection  from  the  laminated  sample  and  the  r.. suits  of  the 
theoretical  calculations  showing  the  phase  velocity  dispersion 
versus  frequency  times  cell  size  are  displayed  in  Fig.  6  (the 
elastic  properties  used  in  the  calculations  are  collected  in 
[10]).  Open  circles  have  been  recorded  with  the  2.0-MHz 
transducers,  while  the  points  denoted  by  the  crosses  correspond 
to  the  10.0-MHz  pair.  The  solid  curve  is  the  prediction  of  the 
approximate  theory.  At  the  low  frequency  end  of  the  curve  the 
normalized  wavelength  (^/d)  is  equal  to  3.4,  with  a  value  of  0.20 
at  the  high  frequency  end.  That  is,  the  data  and  theory  span 
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more  than  one  order  of  magnitude.  In  spite  of  this  large 
frequency  range,  over  which  the  dispersion  changes  markedly  in 
terms  of  the  mechanical  behavior,  rather  good  agreement  is  seen 
across  the  Fd  axis.  In  particular,  the  asymptotic  approach  at 
large  Fd  to  the  Rayleigh  velocity  of  the  more  compliant  material 
(in  this  case  copper)  is  very  accurately  rendered  by  the 
prediction.  Also,  the  wavespeed  derived  from  the  mixture  elastic 
constants  is  well  reproduced  at  low  freq'ioncy,  where  the 
influence  of  the  microstruccure  on  dispersion  is  diminishing. 

In  summary,  we  have  measured  dispersion  of  leaky  Rayleigh 
waves,  where  the  dispersive  behavior  arises  soley  from 
microstructural  features  of  solid.  We  find  the  limiting  behavior 
at  long  wavelengths  plausible. 

CONCLUDING  REMARKS 

The  measurement  of  dispersion  is  difficult,  even  for  the  bulk 
waves.  The  method  for  measurements  of  Rayleigh  wave  dispersion 
as  presented  here  is  quite  efficient.  However  analytical  results 
for  comparisons  are  lacking.  The  results  we  have  compared  with 
are  computed  from  mixture-continuum  theories.  Ideally  an 
independent  verification  is  necessary  for  the  experimental 
results  before  they  can  be  used  to  verify  the  computational 
models.  Some  preliminary  studies  made  by  the  investigator,  both 
for  longitudinal  and  shear  waves,  show  infinite  dispersive  modes 
exist,  similar  in  nature  to  Lamb  waves.  It  may  be  surmised  that 
the  surface  waves  in  such  layered  material  could  also  have 
several  modes,  possibly  with  regions  of  frequencies  for  which 
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real  velocities  do  not  exist 


Without  guidance  from  analysis 


some  of  the  experimental  data  can  be  confusing.  We  are  beginning 
this  analysis  by  studying  the  wave  reflections  from  a  free 
surface  in  such  materials. 
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gure  1.  Transducer  Fixture. 


Figure  3.  Schematic  of  Leaky-Rayleigh  Wave  experiment.  Transmitter  is  fixed, 
while  receiver  scans  along  x-axis,  which  is  coll  inear  with  the 
fluid  interface.  Dashed  lines  are  supressed  specular  reflection. 
Shaded  region  contains  most  of  the  power. 
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Figure  4.  Pulsed  Schlieren  Visualization  of  leaky  Rayleigh 

Wave  phenomena  in  a  copper-stainless  steel  laminate. 
Incident  tone-burst  is  at  right,  reflects  from  the 
surface,  and  appears  distorted  and  displaced  at  the 
left  of  the  frame.  Null  zone  in  reflected  field  is 
clearly  visable. 
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FIG.  5.  Experimental  Reflected  Wave  Signal  versus  Frequency x  Cell  size. 
Amplitude  units  are  arbitrary 
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ABSTRACT 


"Performance  Improvement  in  Knowledge-Based  Control  Systems" 
is  sponsored  by  the  Air  Force  Office  of  Scientific  Services  under 
Contract  No.  F49620-85-C-0013/SB5851-0360,  and  administered  under 
Universal  Energy  Systems  Project  760.  This  research  investigates 
improving  system  performance  by  using  deterministic  knowledge  to 
enhance  resolution  of  conflicts  within  the  program.  Different 
heuristic  and  numeric  strategies  of  conflict  resolution  are 
presented.  Each  strategy  is  reviewed  from  the  viewpoint  of  its 
usefulness  within  the  process  control  environment.  It  is 
determined  that  use  of  deterministic  knowledge  is  inappropriate 
for  conflict  resolution.  However,  alternative  uses  of  this 
knowledge  are  presented. 
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FOREWORD 


This  report  describes  work  completed  under  Contract  Number 
F49620-85-C-0013/SB5851-0360  during  the  period  from  January  1  to 
December  31,  1988.  This  effort  investigates  the  potential  for 
using  deterministic  knowledge  to  enhance  conflict  resolution 
within  knowledge-based  process  control  systems. 

The  project  is  sponsored  by  the  Air  Force  Office  of 
Scientific  Services,  Bolling  AFB,  DC,  administered  by  Universal 
Energy  Systems,  Dayton,  Ohio,  and  conducted  by  researchers  in  the 
Department  of  Industrial  Engineering  at  Louisiana  State 
University  in  Baton  Rouge,  Louisiana.  Major  Steven  LeClair  of  the 
AFWAL  Materials  Laboratory  at  Dayton,  Ohio  is  the  principal  Air 
Force  representative,  and  Mr.  John  M.  Usher  and  Dr.  Gerald  R. 
Graves  are  the  principal  Louisiana  State  University  researchers. 

This  report  is  divided  into  seven  chapters.  The  first 
chapter  introduces  the  concept  of  conflict  resolution  and 
identifies  the  major  classes  of  resolution  strategies.  The  next 
two  chapters  then  present  knowledge-based  control  and  discuss  the 
conflict  resolution  strategies  from  this  viewpoint.  In  the 
remaining  chapters,  qualitative  process  automation  is  examined, 
and  its  method  of  conflict  resolution  is  presented.  Methods  of 
improvement  are  discussed  and  conclusions  provided. 
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CHAPTER  1 


INTRODUCTION 


This  research  report  addresses  the  issue  of  performance 
improvement  in  real-time  knowledge-based  control  systems.  The 
objective  of  this  effort  was  to  enhance  the  technique  for 
resolution  of  conflicts  within  the  knowledge-based  system.  The 
targeted  system  for  applying  these  improvements  is  the 
Qualitative  Process  Automation  (QPA)  controller  developed  and 
patented  by  Air  Force  researchers. 

The  proposed  method  of  conflict  resolution  involved  using 
deterministic  knowledge  to  aid  in  the  resolution  process.  The 
deterministic  knowledge  would  be  in  the  form  of  equations  that 
could  be  solved  by  numerical  methods.  The  results  of  the  study 
found  that  it  is  not  beneficial  to  use  deterministic  knowledge  in 
conflict  resolution.  However,  it  could  be  used  as  one  of  the 
principle  sources  of  object-level  knowledge.  In  addition,  it  is 
proposed  that  QPA  needs  additional  conflict  resolution  strategies 
in  order  to  expand  its  usefulness. 

This  report  is  divided  into  seven  chapters  beginning  with 
this  introduction.  Chapter  2  presents  a  definition  of  conflict 
resolution  followed  by  a  review  of  the  different  conflict 
resolution  strategies  now  available.  The  next  chapter.  Chapter  3, 
introduces  the  knowledge-based  process  control  system  with  a 
discussion  of  its  functions,  capabilities,  and  operational 
requirements.  This  is  followed  by  Chapter  4  which  addresses 
conflict  resolution  from  the  point  of  view  of  the  process  control 
environment.  Chapter  5  then  presents  the  theory  of  Qualitative 
Process  Automation  and  discusses  the  operation  of  the  QPA 
controller  as  applied  to  the  process  of  laminating  composites  in 
an  autoclave.  This  chapter  concludes  with  a  review  of  the  current 
methods  of  conflict  resolution  used  by  QPA.  Chapter  6  reviews  the 
proposed  methods  of  enhancing  conflict  resolution  and  discusses 
the  problems  with  these  methods.  The  last  chapter.  Chapter  7, 
presents  concluding  remarks  and  discusses  future  work. 
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CHAPTER  2 


CONFLICT  RESOLUTION 


2 . 1  Introduction 


In  order  to  discuss  the  techniques  of  conflict  resolution, 
it  is  necessary  to  understand  the  basic  operation  of  a  knowledge- 
based  system.  Knowledge-based  systems  can  vary  dependent  on  the 
type  of  control  strategy  applied  (forward  chaining,  backward 
chaining,  etc.)  and  the  method  used  to  represent  the  knowledge 
(frames,  semantic  nets,  production  rules,  etc.).  Since  production 
systems  are  a  common  type  of  knowledge-based  system,  the 
following  discussion  will  be  based  on  the  operational 
characteristics  of  these  sysitems,  but  the  concepts  extend  to  all 
other  types  of  knowledge-based  systems. 

A  production  system  operates  in  cycles  made  up  of  the  two 
operational  phases:  "recognize”  and  "act."  The  recognize  phase 
can  be  further  divided  into  two  major  steps,  "match"  and  conflict 
resolution  (see  Figure  2.1).  The  "match"  step  employs  some  form 
of  a  control  strategy  (within  the  inference  engine)  to  compare 
the  contents  of  working  memory  with  the  rules  in  the  knowledge¬ 
base.  The  objective  of  this  step  is  to  determine  which  rules  are 
satisfied  by  the  facts  in  working  memory.  When  all  the  conditions 
of  a  rule  are  matched  by  data  elements  within  working  memory,  the 
rule  is  instantiated.  Since  it  is  possible  for  the  system  to 
instantiate  more  than  one  rule  in  a  single  cycle,  all  the 
instantiated  rules  are  placed  in  a  set,  called  the  "conflict 
set."  In  the  next  step,  conflict  resolution,  the  system  will 
order  the  alternative  instantiations  within  the  conflict  set  and 
then  select  one  or  more  of  the  instantiations  to  execute.  The 
execution  of  the  rule  actions  (consequents)  is  the  function  of 
the  "act"  phase  of  the  operational  cycle. 

The  objective  of  the  conflict  resolution  step  is  to  apply 
some  specific  strategy  to  differentiate,  and  therefore  order  the 
instantiations  within  the  conflict  set.  The  need  for  a  conflict 
resolution  strategy  arises  only  in  those  systems  where  multiple 
solutions  are  possible.  It  is  possible  to  construct  a  knowledge 
base  such  that  multiple  solutions  will  not  occur  in  operation, 
this  occurs  when  all  the  knowledge  elements  are  applicable  to 
mutually  exclusive  situations.  Construction  of  such  a  knowledge 
base  is  a  difficult  task  and  reserved  for  use  on  knowledge  bases 
of  small  size,  or  knowledge-  bases  which  use  a  highly-structured 
knowledge  representation  (e.g.,  semantic  nets)  where 
relationships  between  objects  are  more  easily  detected. 

Sometimes  even  though  there  are  several  solutions  possible, 
only  one  solution  is  acceptable.  As  an  example,  consider  the  two 
expert  systems  MYCIN  and  Rl.  MYCIN  [1]  is  an  expert  system  that 
diagnoses  infectious  diseases;  therefore,  it  is  important  that 
the  system  consider  all  possible  diseases  in  order  to  determine 
the  patient's  true  disease.  Then  doctors  can  properly  treat  the 
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Conflict  Resolution 


Ordering 
If  C  then  G 
If  A  then  E 
If  D  then  H 


Figure  2.1:  Production  System  Cycle. 


patient.  However,  some  systems  do  not  need  to  consider  all  the 
possible  solutions,  even  though  they  may  exist.  An  example  would 
be  the  expert  system,  R1  [2],  that  configures  computer  systems 
from  customer  orders.  Any  configuration  that  Rl  determines,  which 
meets  the  constraints,  is  acceptable.  Therefore,  conflict 
resolution  is  not  required  for  all  systems  and  is  dependent  on 
the  application  involved. 

In  the  literature,  the  discussion  of  conflict  resolution  is 
limited  to  knowledge-based  systems  which  use  knowledge  in  the 
form  of  production  rules  (IF-THEN  form) .  Production  rules  is  one 
of  the  most  widely  used  methods  of  knowledge  representations. 
This  representation  lacks  structure  and  as  the  size  of  the 
knowledge  base  grows  so  does  the  chance  of  the  system  arriving  at 
conflicting  alternatives.  It  is  for  this  purpose  that  researchers 
developed  the  strategies  of  conflict  resolution.  In  this  chapter. 
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these  different  strategies  for  conflict  resolution  and  the  logic 
behind  their  use  will  be  presented. 


2.2  Literature  Review 

Conflict  resolution  has  been  of  interest  from  the  beginning 
of  the  development  of  knowledge-based  systems.  Just  as  humans 
arrive  at  several  alternatives  when  reasoning  about  a  solution, 
so  will  most  knowledge-based  systems.  This  results  in  a  need  to 
resolve  the  conflict  between  the  alternative  solutions  and  make  a 
selection  of  the  best  solution  based  on  some  strategy.  The 
following  is  a  dJ.scussion  of  the  conflict  resolution  strategies 
presented  in  the  literature.  These  strategies  are  categorized  as 
heuristic  techniques,  or  numeric  techniques.  These  techniques  of 
conflict  resolution  represent  the  basic  methods  in  use  today. 


2.2.1  Heuristic  Techniques 

One  of  the  fundamental  works  on  conflict  resolution  is  that 
of  McDermott  and  Forgy  [3].  In  this  paper,  McDermott  and  Forgy 
explore  the  requirements  of  conflict  resolution  strategies  for 
production  systems  used  within  a  dynamic  environment.  The 
important  characteristics  of  the  production  system  are  its 
sensitivity  to  changes  in  the  environment,  and  its  stability  in 
the  face  of  these  changes.  A  system's  sensitivity  is 
characterized  by  its  ability  to  respond  quickly  to  changes  which 
occur  in  the  environment.  The  system's  stability  is  determined  by 
how  well  it  maintains  continuity  in  its  behavior.  McDermott  and 
Forgy  state  that  "the  function  of  the  conflict  resolution 
strategy  is  to  provide  a  mechanism  that  can  preserve  sensitivity 
and  stability  without  sacrificing  production  autonomy"  [3]. 

McDermott  and  Forgy  present  five  classes  of  strategies  for 
conflict  resolution.  Each  class  of  rules  is  characterized  by  the 
criteria  it  uses  to  judge  the  worth  of  an  instantiation  (see 
Table  2.1).  Within  each  class,  a  variety  of  individual  rules  for 
resolving  conflicts  can  be  developed.  These  rules  can  be 
distinguished  by  their  selectivity,  and  the  source  of  knowledge 
they  use  to  make  their  decision. 

The  selectivity  of  a  rule  is  defined  by  its  capability  to 
differentiate  the  worth  of  each  member  within  the  conflict  set. 
If  a  rule  is  capable  of  distinguishing  between  the  worth  of  each 
member  of  the  conflict  set  (no  two  are  of  equal  worth)  ,  then  it 
is  possible  for  the  system  to  order  the  alternatives  and  select 
the  best.  Such  a  rule  would  be  considered  strongly  selective.  As 
a  conflict  resolution  rule  decreases  in  selectivity,  it 
capability  to  distinguish  between  the  alternative's  worth  is 
decreased.  This  results  in  a  greater  chance  that  two  or  more 
alternatives  will  be  viewed  as  having  equal  value  as  a  solution. 
In  order  to  make  a  selection,  another  rule  will  have  to  be 
applied  to  provide  further  clarification. 
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Table  2.1:  Classes  of  Conflict  Resolution  Strategies. 


CLASSES  OF  CONFLICT  RESOLUTION  STRATEGIES 

Production  Order  Rules 
Special  Case  Rules 
Recency  Rules 
Distinctiveness  Rules 
Arbitrary  Decision  Rules 


The  possible  knowledge  sources  that  can  be  used  to  make  a 
decision  include  the  knowledge-base,  working  memory,  and  the 
inference  engine's  (control  program's)  state  memory.  The 
knowledge  base  provides  information  about  the  order  in  which  the 
rules  were  added,  or  organized.  Working  memory  provides 
information  on  the  ages  of  the  facts  it  contains,  and  the  order 
those  facts  were  received.  The  third  source,  state  memory,  can 
hold  information  about  the  decisions  made  in  prior  cycles. 

The  five  classes  of  conflict  resolution  strategies,  listed 
in  Table  2.1,  encompass  a  majority  of  the  resolution  techniques 
currently  in  use.  In  the  following  discussion  the  basic  strategy 
of  each  of  the  five  classes  of  rules  is  presented.  For  examples 
of  some  possible  rules  that  can  be  developed  within  each  class, 
the  reader  is  referred  to  [3]. 

The  first  ciass  of  rules,  referred  to  as  "production  order 
rules",  uses  the  ordering  of  the  rules  within  the  knowledge-base 
as  the  criteria  from  which  to  make  its  selection.  The  order 
criteria  can  be  based  on  the  age  of  a  rule  or  possibly  its 
physical  location  within  the  knowledge  base.  This  method  requires 
that  the  user  develop  a  scheme  for  classifying  and  correctly 
placing  each  rule  within  the  knowledge-base.  This  complicates 
maintenance  of  the  knowledge  base. 

The  second  class  of  rules,  "special  case  rules",  favors  an 
instantiation  which  is  a  special  case  of  another  instantiation  in 
the  conflict  set.  The  definition  of  a  "special  case"  is  left  to 
the  user  that  develops  the  explicit  rule.  As  an  example,  consider 
a  resolution  rule  that  states: 

A  rule,  Rl,  is  a  special  case  of  rule,  R2,  if  rule  R1 

has  a  greater  number  of  preconditions  than  rule  R2. 

Then  if  the  two  rules, 

Rule-1 :  IF  (A)  AND  (B)  AND  (C)  THEN  (D) 

Rule-2:  IF  (A)  AND  (B)  THEN  (F) 
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are  members  of  the  conflict  set,  this  strategy  would  favor  Rule-1 
over  Rule-2,  since  Rule-1  has  three  preconditions,  while  Rule-2 
only  has  two.  An  example  of  a  related  rule  that  is  less  selective 
is: 


A  rule,  Rl,  is  a  special  case  of  rule,  R2,  if  the 
preconditions  of  R2  match  a  subset  of  the  preconditions 
of  Rl. 

Using  the  two  example  rules  above,  this  strategy  would  again 
select  Rule-1  over  Rule-2.  All  the  preconditions  of  Rule-2  appear 
in  Rule-1,  therefore,  Rule-1  can  be  considered  a  "special  case" 
of  Rule-2.  Following  the  same  basic  strategy  of  this  class,  other 
variations  of  these  rules  are  possible.  This  strategy  class  is 
considered  weakly  selective,  in  that  it  only  eliminates  those 
instantiations  within  the  conflict  set  that  are  general  cases  of 
other  instantiation  in  the  conflict  set. 

A  third  strategy  for  conflict  resolution,  called  "recency 
rules",  bases  the  ordering  of  the  alternative  instantiations  on 
the  amount  of  time  that  the  data  elements  of  each  instantiation 
have  resided  within  working  memory.  Dependent  on  the  application, 
a  strategy  rule  could  be  developed  that  favors  instantiations 
using  old  elements,  or  one  that  favors  instantiations  using  the 
newer  elements  in  working  memory.  The  method  used  to  measure  the 
age  of  the  data  element  also  affects  the  ordering  of  the 
instantiations . 

The  fourth  strategy,  "distinctiveness  rules",  makes  a 
selection  based  on  the  similarity  or  dissimilarity  of  the 
instantiations  in  the  conflict  set  to  those  instantiations  that 
the  system  selected  in  previous  cycles.  This  strategy  can  be  used 
to  prevent  any  production  from  firing  on  consecutive  cycles. 

The  final  strategy  class  contains  "arbitrary  selection 
rules".  In  this  class,  the  system  performs  the  ordering  of  the 
alternatives  using  some  method  of  random  selection.  This  class  of 
rules  demonstrates  strong  selectivity  since  random  selection 
guarantees  that  the  system  will  be  able  to  distinguish  between 
the  alternatives.  Such  a  strategy  should  only  be  applied  in  cases 
where  no  other  strategy  can  provide  any  further  differentiation 
between  the  members  of  the  conflict  set.  This  strategy  assumes 
that  each  alternative  is  approximately  equivalent  in  its  worth  to 
the  system;  therefore,  random  selection  of  one  will  not  cause  a 
problem. 

McDermott  and  Forgy  [3]  conclude  that  no  simple  conflict- 
resolution  strategy  can  be  completely  satisfactory.  They  conclude 
that  no  single  conflict  resolution  strategy  is  a  strong  supporter 
of  both  sensitivity  and  stability.  Thus  a  combination  of 
strategies  is  required  to  develop  a  sound  technique  for 
resolution  of  conflicts. 

Barr  and  Feigenbaum  [4]  also  discuss  conflict  resolution  in 
production  systems.  They  identify  the  conflict  resolution  phase 
as  the  time  in  the  cycle  of  the  system  when  "basic  cognitive 
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traits  like  action  secpiencing,  attention  focusing, 
interruptibility ,  and  control  of  instability  are  realized". 
Therefore,  because  of  the  capability  of  the  conflict  resolution 
strategy  to  control  these  traits,  the  knowledge  base  maintains  a 
modularity  that  contributes  to  the  stability  of  the  system.  They 
list  seven  common  strategies  used  for  conflict  resolution.  In  the 
discussion  to  follow,  each  of  these  strategies  is  presented  and 
related  back  to  the  five  main  classes  of  strategies  proposed  by 
McDermott  and  Forgy  to  demonstrate  the  comprehensive  nature  of 
these  classes. 

The  first  and  simplest  of  the  strategies  instructs  the 
system  to  use  the  first  rule  it  instantiates.  Therefore,  the 
reasoning  cycle  terminates  at  the  first  instantiation  of  a  rule. 
One  could  argue  that  this  method  is  not  a  resolution  strategy 
since  it  does  not  formulate  a  conflict  set  or  deal  with  more  than 
one  alternative  (no  conflict  ever  arises) .  In  such  a  system,  the 
order  in  which  the  rules  are  placed  in  the  knowledge  base  becomes 
the  ruling  factor  in  their  worth.  This  strategy  falls  within  the 
"production  order"  class  of  conflict  resolution  rules. 

A  second  method  involves  selecting  the  rule  from  the 
conflict  set  which  has  the  highest  priority.  This  strategy 
requires  the  assignment  of  priority  by  the  programmer  according 
to  the  demands  and  characteristics  of  the  task.  The  weakness  of 
this  method  arises  in  the  method  used  to  determine  the  weights  of 
each  rule.  This  method  can  be  considered  to  belong  to  the 
"production  order"  strategy  class,  in  that  the  assignment  of 
priority  values  to  the  contents  of  the  knowledge  base  requires 
the  same  logic  as  ordering  the  knowledge  base  in  terms  of 
importance. 

The  third  method  listed  belongs  to  the  class  of  rules 
labeled  "special  case"  rules.  This  strategy  of  resolving 
conflicts  selects  the  rule  that  is  the  "most  specific"  of  the 
rules  in  the  conflict  set.  The  term  "most  specific"  means  the 
rule  which  has  the  most  detailed  pre-conditions.  This  logic 
assumes  that  the  more  conditions  satisfied  by  rule,  the  better  it 
characterizes  the  current  state  of  the  system. 

The  fourth  method  for  conflict  resolution  involves  selecting 
the  rule  that  refers  to  some  fact  most  recently  added  to  the 
contents  of  working  memory,  a  "recency"  class  rule.  This  strategy 
has  the  effect  of  focusing  the  system.  For  example,  suppose  that 
on  the  first  cycle  two  rules  (Rl  arid  R2)  were  instantiated  and 
the  system  selected  rule  Rl  to  fire.  In  the  next  cycle,  if  the 
system  instantiates  rules  Rl,  R2,  and  R3,  then  the  resolution 
strategy  would  select  R3  since  this  is  evidently  the  action  to 
finish  what  Rl  started.  This  strategy  has  the  effect  of  focusing 
the  system  on  the  results  of  the  actions  most  recently  taken. 

The  fifth  approach  involves  selecting  the  rule  in  the  conflict 
set  which  has  not  previously  been  instantiated  by  the  system. 
This  allows  the  system  to  give  immediate  attention  to  events  that 
have  not  previously  occurred.  This  strategy  belongs  to  the 
"distinctiveness"  class  of  rules.  The  next  approach  to  conflict 
resolution  is  identical  to  the  "arbitrary  selection  rules"  class. 
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in  that  it  just  makes  an  arbitrary  selection  of  a  rule  from  among 
the  conflict  set. 

The  last  method  listed  by  Barr  and  Feigenbaum  [4]  involves 
not  making  a  selection  at  all,  but  instead,  exploring  all  the 
applicable  rules  in  parallel.  This  is  the  strategy  employed  in 
the  expert  system,  MYCIN  [1].  In  this  technique,  the  resolution 
method  is  not  based  on  the  alternative  instantiations,  but  on  the 
effects  that  each  alternative  produces  when  fired.  Such  a 
strategy  would  be  difficult  to  implement  and  would  still  require 
some  means  to  differentiate  the  solutions  that  each  line  of 
reasoning  determines.  For  physical  systems,  like  a  process 
control  system,  this  strategy  would  require  a  simulator  to 
explore  the  effects  of  each  applicable  rule  in  the  conflict  set. 
This  strategy  does  not  belong  to  any  of  the  five  main  classes  of 
conflict  resolution  strategies.  However,  it  could  be  implemented 
using  a  technique  based  on  meta-knowledge,  the  topic  to  be 
discussed  next. 

A  new  strategy  class,  not  yet  discussed,  involves  the  use  of 
meta-rules  to  resolve  conflicts.  A  meta-rule  is  one  form  of  meta¬ 
level  knowledge  [5].  Simply  put,  meta-knowledge  is  knowledge 
about  the  system  knowledge  (object-level  knowledge) .  In  the 
context  of  conflict  resolution,  the  meta-rules  would  contain 
knowledge  that  could  be  used  to  make  a  selection  from  among  the 
alternatives  within  the  conflict  set.  In  this  regards,  meta-rules 
would  qualify  as  a  selection  strategy  for  conflict  resolution. 

The  knowledge  contained  in  the  meta-rules  can  encompass 
general  process  knowledge  or  even  address  the  current  process 
state.  Therefore,  the  meta-rules  can  be  applied  to  reason  about 
the  usefulness  of  the  knowledge  that  each  alternative  within  the 
conflict  set  provides.  This  results  in  the  selection  of  the 
alternative (s)  most  appropriate  for  the  given  context.  Example 
meta-rules  for  a  batch  process  might  state: 

IF  (  in  the  cooldown  sequence  ) 

THEN  (  ignore  all  rules  related  to  heat  effects  ) 

or  possibly, 

IF  (  current  temperature  >  maximum  ) 

THEN  (  ignore  all  rules  requesting  temperature  increases  ) 

Meta-rules  differ  from  the  other  heuristic  strategies.  The  other 
strategies  do  not  consider  the  usefulness  of  the  alternative, 
they  instead  use  a  fixed  set  of  rules  that  consider  the 
characteristics  possessed  by  each  alternative  (age,  placement, 
number  of  preconditions,  similarity  to  other  knowledge  elements, 
etc . )  . 

One  important  point  about  meta-rules  is  that  they  do  not 
make  any  conclusions  about  the  validity  of  the  alternative  rules 
in  the  conflict  set.  They  only  make  a  decision  based  on  the 
utility  of  the  rules.  Consequently,  the  knowledge  provided  by  the 
meta-rules  cannot  be  embodied  within  the  object  level  rules 
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themselves.  The  meta-level  knowledge  used  by  the  conflict 
resolution  system  would  be  separate  from  the  object  level 
knowledge  in  the  knowledge  base. 

The  selectivity  of  the  meta-rules  is  dependent  on  the  detail 
provided  by  the  meta-knowledge  they  embody.  It  would  be  difficult 
to  provide  the  knowledge  necessary  to  deal  with  all  possible 
conflicts;  therefore,  this  strategy  is  best  applied  in 
combination  with  another.  One  advantage  provided  by  the  use  of 
meta-rules,  is  that  the  resolution  strategy  is  stated  explicitly 
in  rule  form.  This  aids  in  the  editing  and  maintenance  of  the 
system  and  provides  a  means  for  explaining  system  behavior. 

In  an  article  by  Sauers  and  Walsh  [6],  they  divide  conflict 
resolution  strategies  into  two  major  types:  elimination 
strategies  and  selection  strategies.  Elimination  strategies  are 
those  methods  which  rule  out  the  consideration  of  certain 
alternatives  based  on  some  criteria.  The  selection  strategies  are 
those  methods  that  will  pick  from  among  the  alternatives  those 
deemed  acceptable.  Using  these  definitions  it  is  possible  to 
further  divide  the  six  classes  of  conflict  resolution  strategies, 
defined  earlier,  into  these  two  major  categories  (see  Figure 
2.2)  . 

Sauers,  and  Walsh,  believe  that  when  a  conflict  resolution 
strategy  is  used  in  a  real-time  knowledge-based  system  (such  as 
process  control) ,  it  should  possess  additional  capabilities  to 
increase  the  efficiency  of  the  system.  They  state  that  the 
standard  methods  of  conflict  resolution  are  flawed  in  that  they 
are  "unaware  of  the  characteristics  of  the  system  environment, 
and  are  thus  unresponsive  to  changes  in  this  environment"  [6].  If 
the  conflict  resolution  system  were  to  take  into  account  system 
performance  statistics,  it  could  use  these  statistics  to  aid  in 
selecting  the  optimum  alternative.  Sauers  and  Walsh  [6],  go  on  to 
mention  that  conflict  resolution  strategies  "should  also  be 
allowed  to  address  real-time  constraints  by  enabling  productions 
themselves  to  alter  the  conflict  resolution  strategy  used  in 
critical  situations."  They  also  propose  the  use  of  meta-conflict 
resolution  procedures  to  test  the  validity  of  possible 
instantiations  before  they  are  placed  within  the  conflict  set, 
hoping  to  reduce  the  number  of  required  instantiations.  No  detail 
is  provided  on  how  these  capabilities  would  be  implemented. 


2.2.2  Numeric  Techniques 

In  addition  to  the  methods  discussed  above,  there  exist 
numeric  methods  of  conflict  resolution.  Ferrante  [7]  states  that 
"conflict  resolution  has  been  treated  by  a  number  of  different 
authors  who  use  various  techniques  and  terminology  such  as 
reasoning  about  uncertainty  or  'the  integration  of  knowledge  from 
disparate  sources ' . "  He  found  that  most  work  in  the  area  of 
conflict  resolution  has  focused  on  the  use  of  numeric  approaches 
to  conflict  resolution,  such  as  Bayesian  or  Dempster-Shafer 
techniques. 
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Conflict  Resolution  Strategies 


Elimination  Strategies  Selection  Strategies 


Figure  2.2:  Classes  of  Heuristic  Conflict  Resolution 

Strategies. 


The  numeric  methods  of  Bayesian  and  Dempster-Shafer  are 
techniques  for  reasoning  about  the  uncertainty  of  the  knowledge 
applied  to  solve  the  problem.  These  techniques  provide  a  method 
for  propagating  the  certainty  in  a  line  of  reasoning.  Therefore, 
the  result  of  using  this  type  of  numeric  method  would  be  a 
conflict  set  where  each  instantiation  has  a  certainty  value 
associated  with  it.  These  priority  values  can  then  be  used  to 
order  the  alternatives  within  the  conflict  set,  and  to  perform 
the  elimination  and  selection.  Therefore,  the  conflict  resolution 
method  would  encompass  the  Bayesian  or  Dempster-Shafer 
calculations  along  with  a  procedure  used  to  order  the 
alternatives . 

These  numeric  methods  of  conflict  resolution  are  similar  to 
the  "priority  rule",  mentioned  in  Barr  and  Feigenbaum  [4].  The 
priority  values  represent  the  resulting  belief  values  calculated 
for  each  alternative  within  the  conflict  set.  Compared  with  the 
heuristic  strategies  discussed  previously,  these  numeric  methods 
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attempt  to  apply  a  more  formalistic  strategy  for  determining  the 
worth  of  each  alternative  within  the  conflict  set.  These  methods 
utilize  belief  functions  to  characterize  the  worth  of  a  deducted 
conclusion  (rule  consequent) .  Using  the  belief  value  associated 
with  each  alternative  in  the  conflict  set,  the  members  can  be 
ordered.  This  idea  is  similar  to  the  heuristic  strategy  which 
uses  priority  values  as  the  criteria  for  ordering.  The  difference 
between  the  two  is  in  the  method  used  to  obtain  the  criteria 
(belief  values)  for  those  rules  within  the  conflict  set.  In  the 
heuristic  method,  the  priority  values  associated  with  a  rule  are 
constant  and  not  dependent  on  the  line  of  reasoning  used  to 
arrive  at  an  alternative  solution.  In  the  numeric  approach,  the 
belief  functions  are  dependent  on  the  evidence  provided.  The 
system  propagates  the  belief  values  through  the  reasoning  and  the 
final  values  obtained  depend  on  the  evidence  used  to  arrive  at 
the  conclusions.  Using  the  final  belief  values,  the  system  will 
order  the  alternatives  and  then  make  a  selection.  Since  the 
propagation  of  these  belief  values  is  part  of  the  reasoning 
technique,  the  function  of  the  conflict  resolution  system  becomes 
more  an  integral  part  of  the  "recognize"  phase  and  is  not  as 
distinct  in  its  operation,  as  for  the  heuristic  methods. 

The  difference  among  the  various  numeric  methods  of  conflict 
resolution  is  the  mathematical  technique  used  to  propagate  the 
belief  values  used  in  the  reasoning  of  the  system.  The  two  basic 
methods  involve  the  use  of  the  Bayesian  or  Dempster-Shafer 
technique  for  propagation  of  the  belief  values  in  the  reasoning 
of  the  system  [8] [9].  Bayes'  rule  represents  the  belief  function 
using  statistical  probabilities.  Here,  the  probability 
establishes  the  belief  in  the  deduction  made  based  on  the 
probabilities  of  the  evidence  used  (i.e.,  preconditions).  The 
weakness  of  this  approach  lies  in  the  requirement  that  the 
conditional  probabilities  concerning  system  states  be  available 
'a  priori ' . 

Dempster-Shafer  calculus  provides  a  means  for  propagating 
belief  values  in  a  more  general  manner  than  the  Bayesian 
approach.  The  Dempster-Shafer  technique  allows  for  multiple 
beliefs  in  that  it  does  not  require  that: 

Prob (event  "a"  occurs)  +  Prob (event  "a"  does  not  occur)  =  1 

as  is  the  case  in  Bayes'  approach.  The  disadvantage  arises  in 
that  the  Dempster-Shafer  method  is  more  complicated  to  apply  than 
the  Bayesian  approach. 

These  two  methods  represent  only  the  basic  techniques  that 
can  be  used  for  propagating  belief  functions.  Examples  of  other 
numeric,  and  heuristic,  strategies  of  conflict  resolution  will  be 
given  in  the  next  section. 
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2.2.3 


Implementations  and  Extensions  of  the  Basic  Strategies 

In  the  following  discussion,  five  example  implementations  of 
conflict  resolution  techniques  are  presented.  These  techniques 
represent  possible  extensions  or  combinations  of  the  basic 
strategies  discussed  in  the  previous  sections. 

An  automatic-programming  system,  PECOS,  developed  by  David 
Barstow  and  discussed  in  Bar  and  Feigenbaum  [4],  uses  a 
combination  of  techniques  for  conflict  resolution.  This 
implementation  is  unique  in  its  approach  to  conflict  resolution, 
in  that  it  employs  one  of  three  strategies  depending  on  the 
current  situation.  For  example,  if  PECOS  is  interacting  directly 
with  the  user,  then  the  user  has  the  power  to  resolve  the 
conflict.  If  the  user  chooses  not  to  make  the  selection,  then 
PECOS  will  make  a  selection  of  one  of  the  applicable  rules  using 
heuristics  (meta-rules)  to  determine  the  alternative  which 
produces  the  most  efficient  implementation.  The  heuristics  are 
able  to  handle  about  two-thirds  of  all  the  possible  cases  which 
arise.  In  the  event  the  user  does  not  make  a  selection  and  the 
heuristics  do  not  apply,  then  PECOS  has  the  capability  to  apply 
each  alternative  in  parallel.  This  results  in  several  solutions 
which  can  be  compared  directly  in  terms  of  some  goal  criteria. 

Another  variation  of  the  basic  heuristic  conflict  resolution 
strategies  can  be  found  in  the  language  YES/Ll  [10],  YES/Ll  is 
designed  for  use  with  real-time  applications  and  provides  a 
method  for  the  integration  of  rule-based  programming  and 
procedural  programming  in  IBM  environments.  In  general,  YES/Ll 
combines  the  heuristic  strategies  of  production  order,  recency, 
special  case,  arbitrary  decision,  and  distinctiveness,  to 
formulate  the  overall  method  of  resolving  conflicts.  As  a  first 
step  the  system  employs  the  use  of  assigned  priority  values  for 
each  rule  to  provide  initial  ordering.  If  a  tie  results,  then  the 
system  selects  the  instantiation  that  was  most  recently  created. 
If  a  tie  still  exists,  then  the  system  selects  the  "most 
specific"  rule,  defined  as  the  rule  with  the  greater  number  of 
preconditions.  As  a  last  resort  to  resolving  remaining  conflicts, 
the  system  will  make  a  random  selection.  After  a  rule  is 
instantiated  once,  it  is  not  allowed  to  fire  again  until  the  data 
in  the  instantiation  changes  in  value.  An  instantiation  is 
automatically  re-enabled  for  firing  only  when  its  rule  refers  to 
an  attribute  (in  the  antecedent)  that  has  changed  since  the 
previous  firing.  This  eliminates  the  repeated  firing  of  a  rule 
and  requires  that  the  rule’s  precondition  be  found  false  before 
it  can  again  be  instantiated.  In  addition  to  these  capabilities, 
YES/Ll  allows  the  programmer  to  access  the  conflict  set  and 
program  a  custom  strategy  for  conflict  resolution. 

Garvey,  et  al.  [11] »  discusses  a  numeric  approach  to 
conflict  resolution  which  is  an  extension  of  Shafer’s 
mathematical  theory  of  evidence.  The  resulting  technique,  like 
the  Dempster-Shafer  method,  is  more  general  than  either  a  Boolean 
or  Bayesian  approach.  Garvey’s  purpose  was  to  develop  a  system 
which  could  interpret  the  important  aspects  of  a  system's  state 
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based  on  information  obtained  from  different  sources,  and 
integrate  this  knowledge  to  arrive  at  the  appropriate  conclusion. 

Another  possible  method  for  conflict  resolution,  combines 
heuristic  and  numeric  techniques  for  use  in  systems  employing 
multiple  knowledge  sources  [7].  The  conflict  resolution  strategy 
integrates  the  concept  of  reasoning  about  uncertainty,  to 
determine  the  relative  validity  of  beliefs,  with  the  concept  of 
constraint  propagation,  which  uses  contradictions  to  restrict  the 
range  of  applicable  choices.  This  new  strategy  is  termed 
"characteristic  error”  conflict  resolution  and  is  unique  in  its 
"method  of  treating  each  knowledge  source  as  a  separate  entity 
whose  validity  is  determined  only  from  the  information  the 
knowledge  source  itself  provides"  [7].  The  idea  is  that  within  a 
multi-source  system,  each  individual  knowledge  source  is,  to  some 
extent,  the  best  judge  of  its  own  performance. 

When  a  using  a  shell  to  develop  a  knowledge-based  system, 
the  user  must  contend  with  the  strategies  provided  by  the  vendor. 
Sometimes  the  shell  will  allow  the  user  to  implement  a  custom 
strategy  or  develop  a  combination  of  strategies  from  those 
provided.  As  an  example,  consider  GEST,  a  generic  expert  system 
tool,  from  Georgia  Tech's  Research  Institute  [12].  GEST  allows 
the  user  to  customize  the  conflict  resolution  strategy.  GEST 
supports  six  different  strategies  and  allows  for  prioritized 
combinations  of  these  strategies  to  be  employed.  The  conflict 
resolution  strategies  provided  are  [12]: 

1.  complexity  -  fire  the  most  complex  rule 

2.  simplicity  -  fire  the  simplest  rule 

3.  recency  -  fire  the  most  recently  used  rule 

4.  reverse  recency  -  fire  the  least  recently  used  rule 

5.  antecedent  ordered  -  fire  the  rule  with  the  highest 

antecedent  priority. 

6.  consequent  ordered  -  fire  the  rule  with  the  highest 

consequent  priority. 

Each  of  these  strategies  can  readily  be  categorized  into  the  one 
of  the  six  main  classes  as  presented  by  McDermott  and  Forgy  [3]. 

Based  on  the  above  findings,  it  is  evident  that  there  is  no 
single  method  for  conflict  resolution.  Many  strategies  are 
possible,  and  the  selection  of  the  appropriate  strategy  should  be 
dependent  on  the  application. 


82-15 


CHAPTER  3 


KNOWLEDGE-BASED  PROCESS  CONTROL  SYSTEMS 


3 . 1  Introduction 

The  application  of  knowledge-based  systems  to  process 
control  is  a  relatively  new  area  of  research,  but  one  that  is 
gaining  the  attention  of  researchers  in  both  the  academic  and 
industrial  environments  [13].  This  interest  arises  because  both 
the  manufacturing  and  process  industries  have  complex  production 
management  and  control  problems  which  current  analytical  methods 
are  often  unable  to  solve,  or  too  difficult  to  deal  with  at  the 
production  level.  The  operation  of  these  complex  processes 
usually  involves  a  human  operator  who  relies  on  the  use  of 
empirical  knowledge  gained  from  operational  experience.  Due  to 
their  experience,  it  is  possible  for  these  operators  to  provide 
near-optimal  operation  in  terms  of  the  product's  quality  and 
consistency.  Processing  problems  arise  when  more  than  one 
operator  is  responsible  for  a  given  process.  The  differences  in 
the  methods  of  control  employed  by  each  operator  results  in 
variations  in  product  quality  and  material  consumption.  These 
variations  in  the  product,  and  the  inconsistency  in  the  applied 
control  strategy,  are  two  primary  reasons  for  seeking  to  automate 
the  control  of  these  ill-understood  processes  [14]. 

The  knowledge-based  process  control  system  provides  a  way  to 
implement  a  consistent  control  strategy  that  employs  both  the 
heuristic  knowledge  used  by  the  operator  and  the  deterministic 
knowledge  from  control  theory.  These  knowledge-based  control 
systems  can  be  used  to  solve  a  whole  host  of  problems  where 
conventional  methods  are  inadequate  -  for  example,  where  no 
mathematical  or  algorithmic  solution  exists  because  data  is 
inexact,  uncertain,  or  based  on  probabilities. 

In  the  discussion  which  follows,  the  general  characteristics 
of  a  knowledge-based  system,  as  applied  to  process  control,  will 
be  presented.  Discussion  begins  by  defining  the  type  of  control 
applied  to  batch  and  continuous  processes.  This  section  is 
followed  by  a  presentation  of  the  functions  that  a  knowledge- 
based  program  can  perform  when  used  in  the  process  control 
environment.  The  last  two  sections  include  discussions  on  the 
structure  of  the  knowledge-based  control  system,  and  the 
operational  requirements  for  the  knowledge-based  program  when 
used  in  the  dynamic  process  control  environment. 


3 . 2  Continuous  and  Batch  Control  Systems 

The  purpose  of  a  process  control  system  is  to  maintain 
control  of  the  process  within  specified  operational  constraints. 
The  method  used  to  control  the  process  is  dependent  on  whether 
the  process  operates  in  a  continuous  or  batch  mode.  A  continuous 
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process  is  perceived  as  a  process  that  continues  to  operates 
until  either  no  more  product  is  desired,  or  some  failure  causes  a 
shutdown.  Once  the  process  is  up  and  running,  the  system  will 
experience  upsets  in  the  process  due  to  disturbances  (e.g., 
changes  in  the  feedrate,  feed  composition,  and  steam  pressure, 
etc.}.  The  controller  compensates  for  these  disturbances  by 
changing  the  values  of  the  manipulative  variables.  This  type  of 
control  is  referred  to  as  "regulatory*'  control.  The  overall 
objective  is  to  maintain  the  control  variables  at  their  set- 
points.  For  the  start-up  and  shutdown  of  the  process,  the  system 
requires  different  methods  of  control.  These  methods  encompass 
techniques  ranging  from  manual  control  to  methods  of  "servo" 
control  where  the  set-point  is  adjusted  and  used  to  drive  the 
process  to  the  desired  state.  Since  these  transient  stages 
(start-up  and  shutdown)  represent  only  a  small  percentage  of  the 
operating  cycle  of  a  continuous  process,  the  strategies  for  these 
steps  are  not  considered  to  be  an  integral  part  of  the  control 
system  and  are  handled  by  the  engineering  staff. 

On  the  other  hand,  in  batch  processes  the  transient  phases 
of  start-up  and  shutdown  can  be  a  major  part  of  the  process 
cycle,  and  therefore,  control  of  these  phases  of  the  process 
become  important.  In  addition,  sometimes  a  batch  process  is 
sequenced,  that  is,  the  process  is  driven  to  proceed  through  a 
series  of  process  steps  from  start  to  completion.  For  these 
processes  servo  control  is  the  major  method  of  control.  Examples 
of  batch  processes  include  the  blending  of  ingredients  in  the 
food  industry,  the  mixing  of  chemicals  to  form  a  reaction  in  a 
reactor,  and  composite  lamination  in  an  autoclave. 


3 . 3  Functions  of  the  Knowledae-based  Process  Control  System 

In  addition  to  fundamental  control  of  the  process, 
researchers  have  developed  and  commercialized  methods  for 
providing  additional  capabilities  within  the  control  system. 
These  capabilities  include  techniques  for  self-tuning  control  and 
process  optimization.  Industry  was  and  still  is  slow  to  accept 
these  methods  because  they  are  rigorous  and  complex.  With  the 
introduction  of  knowledge-based  systems  in  the  area  of  process 
control,  researchers  immediately  began  to  investigate  using  the 
new  technology  to  provide  these  same  extended  capabilities  (self¬ 
tuning  control,  process  optimization)  and  more.  The  key  was  that 
instead  of  having  to  characterize  the  system  mathematically,  the 
system  could  be  modeled  heuristically  and  the  knowledge  used  by 
the  operator  could  be  modeled  and  applied  to  control  the  process. 
The  functions  that  these  new  knowledge-based  systems  can  provide 
include: 

1.  self-tuning  control  of  an  inner  loop  controller, 

2.  optimization  of  the  controlled  process, 

3.  process  monitoring,  diagnosis,  and  alarm  advising,  and 

4.  assistance  in  process  start-up  and  shut-down. 
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Therefore,  the  idea  of  applying  knowledge-based  systems  to 
process  control  has  prompted  a  reexamination  of  old  problems. 


Knowledge-Based  Process  Control  System 


Figure  3.1:  Block  diagram  of  a  knowledge-based  process 
control  system. 


3 . 4  The  Structure  of  a  Knowledge-Based  Process  Control  System 

The  fundamental  structure  of  a  knowledge-based  process 
control  system  can  be  represented  by  the  block  diagram  shown  in 
Figure  3.1.  The  knowledge-based  component  is  interconnected  to 
the  process  in  a  feedback  loop.  The  function  of  the  knowledge- 
based  component  is  to  observe  the  output  of  the  process  (through 
the  sensor  values)  and  formulate  a  conclusion  about  the  perceived 
state  of  the  process.  Then  the  knowledge-based  program  will  make 
a  decision  on  how  to  react  (manipulate  the  process)  to  position 
the  process  at  a  desired  state.  The  method  in  which  these 
functions  are  implemented  varies  dependent  on  the  application 
process,  and  the  level  of  interaction  between  the  process  and  the 
knowledge-based  system. 
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Operational  Constraints 


Figure  3.2:  Block  diagram  of  a  knowledge-based  process  control 
system  with  the  knowledge-based  controller  in  a  supervisory 
position. 


For  each  type  of  process,  continuous  or  batch,  the 
knowledge-based  component  of  the  system  can  interact  with  the 
process  at  two  different  levels.  At  the  lowest  level,  the 
knowledge-based  component  will  directly  control  the  process  by 
manipulation  of  the  final  control  elements  through  interface 
hardware.  The  second  level  of  interaction,  places  the  knowledge- 
based  portion  of  the  system  outside  of  the  feedback  loop.  The 
knowledge-based  program  then  issues  commands  to  a  conventional 
controller  within  the  feedback  loop  (see  Figure  3.2).  This  places 
the  knowledge-based  program  in  a  supervisory  mode  and  alleviates 
some  of  the  constraints  on  the  design  requirements  of  the 
knowledge-based  program  itself.  In  this  configuration,  the 
knowledge-based  program  will  interact  with  the  controller 
providing  adjustments  to  set-point  and/or  tuning  parameters.  At 
either  level  of  interaction,  the  knowledge-based  program  provides 
an  interface  for  the  operators  to  interact  with  the  process. 
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The  knowledge-based  portion  of  the  control  system  is 
composed  of  several  major  components  which  include  the  knowledge 
base,  the  control  program,  and  working  memory  (see  Figure  3.3). 
The  control  program  directs  the  reasoning  process  of  the  system 
using  the  knowledge  base  and  the  evidence  contained  in  working 
memory.  This  reasoning  results  in  control  actions  that  the 
program  issues  directly  to  the  controller  (or  the  control 
elements  directly) .  The  knowledge  base  is  composed  of  facts  and 
heuristics.  "The  'facts'  constitute  a  body  of  information  that  is 
widely  shared,  publicly  available,  and  generally  agreed  upon  by 
experts  in  a  field.  The  'heuristics'  are  mostly  private,  little- 
discussed  rules  of  good  judgment  (rules  of  plausible  reasoning, 
rules  of  good  guessing)  that  characterize  expert-level  decision 
making  in  a  field"  [15].  Working  memory  is  a  dynamic  database 
that  is  used  to  store  evidence  (proposed  facts)  about  the  system 
that  are  currently  true.  These  facts  can  be  anything  from  the 
variable  values  derived  from  the  sensors  to  information  about  the 
process  states  that  the  system  has  obtained  in  its  reasoning. 


Figure  3.3:  Block  diagram  of  a  knowledge-based  program. 
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A  knowledge-based  process  control  system  differs  from  a 
conventional  control  system  in  several  ways.  One  difference  is 
that  a  knowledge-based  process  control  system,  uses  symbolic 
knowledge  to  reason  about  the  process  as  opposed  to  the 
conventional  means  of  mathematical  equations.  A  second  difference 
is  that  the  knowledge  is  used  to  emulate  the  behavior  of  the 
operator  rather  than  the  process.  A  third  difference  is  that  the 
knowledge-based  control  system  will  operate  in  an  event-driven 
mode  as  opposed  to  the  data-driven  mode  of  conventional 
controllers.  Finally,  in  a  conventional  controller  the  knowledge 
pertinent  to  the  process  and  the  methods  for  using  this  knowledge 
are  intertwined.  Whereas,  in  a  knowledge-based  system,  these  two 
components  are  separate,  thus  allowing  for  ease  in  maintenance 
and  modification. 

The  knowledge-based  process  control  system  provides  a  means 
of  applying  a  consistent  control  strategy  using  experiential 
knowledge,  assuming  that  such  knowledge  is  available.  This 
consistent  control  strategy  satisfies  the  goal  of  reducing 
product  variation  and  improving  product  quality. 


3 . 5  Operational  Requirements 

In  a  production  environment,  the  data  used  in  controlling 
the  process  is  dynamic  in  nature.  This  operating  environment  is 
different  from  that  of  other  knowledge-based  systems 
applications.  Because  of  the  differing  environments,  the 
knowledge-based  program  must  possess  additional  capabilities  in 
terms  of  its  operating  characteristics.  These  capabilities  will 
include  [13]: 

.  providing  time  critical  responses, 

.  providing  non-monotonic  reasoning, 

.  performing  in  continuous  operation, 

.  catering  to  asynchronous  events, 

.  interfacing  to  external  environments, 

.  dealing  with  missing  information, 

.  providing  temporal  reasoning,  and 
.  providing  integration  with  procedural  routines. 

In  the  discussion  that  follows  each  of  these  characteristics  is 
addressed  with  reference  to  a  knowledge-based  process  control 
system. 

A  knowledge-based  system  must  operate  within  the 
requirements  set  forth  by  the  process.  Some  processes  require 
response  times  in  milliseconds  while  some  can  wait  hours  for  a 
response.  Therefore,  the  response  time  requirement  for  a 
knowledge-based  control  system  will  depend  on  the  application  of 
interest.  When  the  knowledge-based  system  responds  within  the 
limits  of  the  process,  it  can  be  considered  to  be  a  real-time 
system.  This  follows  the  description  of  Laffey,  et  al.  [13]: 
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"If  given  an  arbitrary  input  (or  event)  and  an 
arbitrary  state  of  the  system,  the  system  always 
produces  a  response  by  the  time  it  is  needed  then  the 
system  is  said  to  be  real-time.” 

In  a  production  process,  the  variables  are  continuously 
changing  in  value.  Facts  about  the  state  of  the  process  will 
change  from  one  sample  period  to  the  next.  Therefore,  any  facts 
that  are  derived  based  on  a  previous  state  may  no  longer  hold. 
Thus  the  knowledge-based  program  must  retract  assertions  from 
working  memory  that  are  no  longer  true.  Such  reasoning  is  labeled 
non-monotonic . 

Most  of  the  current  applications  of  knowledge-based  systems 
are  in  the  form  of  expert  systems  that  are  used  in  a  batch  mode. 
That  is,  the  program  is  executed  for  a  specific  purpose  which  has 
a  definite  stopping  point.  When  a  knowledge-based  process  control 
system  is  attached  to  a  continuous  process,  it  must  operate 
continuously.  If  an  error  in  processing  arises,  or  insufficient 
data  are  available,  the  system  must  continue  to  provide 
consistent  control  of  the  process. 

In  a  manufacturing  process  unscheduled  events  can  occur  at 
any  time.  These  events  include  such  problems  as  tool  failure, 
equipment  failure,  and  workpiece  problems.  Similarly,  in  a 
production  environment  unscheduled  events  can  occur  due  to  sensor 
failure,  and  certain  process  problems  (e.g.,  fires,  leaks,  etc.). 
A  knowledge-based  process  control  system  must  be  able  to  handle 
interrupts  in  its  normal  cycle  of  operation  to  attend  to 
asynchronous  (emergency)  events. 

In  most  current  applications  of  knowledge-based  systems,  the 
system  assimilates  its  data  from  a  database  or  by  querying  the 
user.  In  a  process  control  application,  the  knowledge-based 
system  will  obtain  a  majority  of  its  data  (some  may  still  reside 
in  databases  or  the  user)  through  an  interface  to  the  external 
environment.  The  system  must  be  able  to  read  in  sensor  data,  and 
store  and  manipulate  this  data  for  use  in  reasoning.  This  can 
present  additional  problems  due  to  sensor  degradation,  or 
failure. 

Since  the  process  state  changes  over  time,  the  knowledge- 
based  system  must  have  some  capability  to  reason  about  the  past, 
present,  and  future.  Therefore,  the  knowledge-based  process 
control  system  must  be  capable  of  dynamic  and  temporal  reasoning. 
The  system  must  reason  about  transient  data  and  transient  process 
states.  The  system  must  also  accommodate  delays  in  initiating 
actions  and  obtaining  responses. 

The  knowledge-based  system  must  also  have  access  to 
procedural  routines  as  provided  in  conventional  programming.  The 
system  will  use  these  procedural  routines  to  perform  such  duties 
as  data  compression,  signal  processing,  certain  I/O  related 
functions,  and  some  numerical  methods  processing  (equation 
solving,  optimization,  etc.). 

These  characteristics  are  not  an  all  inclusive  list  of  the 
required  capabilities  for  a  knowledge-based  process  control 
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system.  Some  system  will  require  more  capabilities  than  these, 
and  some  might  require  less.  The  needs  of  the  system  are  dictated 
by  the  application  of  interest. 
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CHAPTER  4 


CONFLICT  RESOLUTION  AND  KNOWLEDGE-BASED  PROCESS  CONTROL 

4 . 1  Introduction 

This  chapter  will  present  and  discuss  the  concepts  of 
conflict  resolution  from  the  viewpoint  of  knowledge-based  process 
control  systems.  The  objective  of  this  discussion  is  to  provide 
the  groundwork  for  evaluating  current  techniques  for  conflict 
resolution  and  for  proposing  new  strategies. 

The  discussion  begins  with  a  brief  explanation  of  why  there 
is  a  need  for  conflict  resolution.  This  is  followed  by  a  review 
of  basic  conflict  resolution  strategies  and  their  application  to 
knowledge-based  process  control  systems.  The  chapter  is  concluded 
with  a  review  of  the  strategies  presented  in  the  literature, 
along  with  an  explanation  for  the  trends  discovered. 


4.2  Conflict  Resolution 


Most  knowledge-based  control  systems,  in  current  use, 
include  the  human  operator  as  an  integral  part  of  the  outer  loop, 
acting  as  the  interface  mechanism  between  the  knowledge-based 
system  and  the  process  [16].  This  configuration  creates  a 
knowledge-based  control  system  which  performs  as  an  open-loop 
system  for  process  diagnosis.  Such  knowledge-based  control 
systems  perform  causal  reasoning  about  process  conditions  using 
experienced-based  heuristics  developed  by  the  experienced  process 
and  control  engineers.  These  systems  diagnose  process  anomalies 
and  propose  remedies  (control  actions).  It  is  then  the  operator's 
decision  to  either  implement  one  of  the  proposed  remedies  or  use 
a  solution  of  their  own. 

The  next  technological  advancement  involves  closing  the 
outer  loop  and  allowing  the  knowledge-based  system  to  provide 
direct  control  of  the  process  (as  was  shown  in  Figures  3.1  & 
3.2).  This  design  transfers  the  decision-making  responsibility 
from  the  human  operator  to  the  knowledge-based  system.  In  this 
mode,  however,  multiple  valid  remedies  about  how  to  control  a 
problem  may  exist.  This  results  in  a  conflict  in  determining 
which  alternative  remedy  to  implement.  Consequently,  a  conflict 
resolution  system  is  needed. 


4 . 3  Nature  of  the  Conflicting  Alternatives 

A  conflict  resolution  strategy  operates  on  the  alternatives 
within  the  conflict  set.  Therefore,  the  application  of  the 
strategy  is  dependent  on  the  type  of  alternatives.  Within  the 
process  control  environment  there  are  different  types  of 
alternatives  dependent  on  the  characteristics  of  the  knowledge- 
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based  program.  In  this  section,  these  characteristics  and  the 
types  of  alternatives  are  presented. 

The  first  question  becomes:  “Is  the  type  of  alternative 
dependent  on  the  level  of  interaction  (supervisory  or  direct) 
between  the  knowledge-based  program  and  the  process?”  If  the 
program  is  used  in  a  supervisory  position,  then  the  alternatives 
will  make  absolute  or  relative  changes  to  the  set-point,  or 
controller  parameters.  If  the  program  is  used  directly  in  the 
feedback  loop,  then  the  alternatives  will  be  commands  to 
manipulate  the  final  control  elements.  Again,  these  actions  will 
be  absolute  or  relative  changes  in  the  levels  of  those  control 
elements.  So,  for  both  levels  of  control  interaction,  the  nature 
of  the  alternatives  is  identical  in  that  they  both  prescribe 
relative,  or  absolute,  changes  in  a  parameter.  However,  the 
nature  of  the  alternatives  would  differ  for  knowledge-based 
programs  that  differ  in: 

the  numi ar  of  levels  of  reasoning  used  by  the  system,  and 
the  capabilities  implemented  in  the  system  (self-tuning 
control,  optimization,  alarm  advising,  etc.) 

First,  consider  the  number  of  levels  of  reasoning  used  by 
the  system.  The  number  of  levels  (depth)  of  reasoning  is  defined 
by  how  many  "recognize-act”  cycles  the  system  uses  to  arrive  at  a 
conclusion  based  on  the  initial  evidence  obtained.  For  example, 
consider  a  rule-based  knowledge  representation  where  forward 
chaining  is  the  method  of  inference.  A  part  of  the  knowledge  base 
might  appear  as  follows: 

RULE-1  IF  temperature  >  400. 

AND  viscosity  >  35. 

THEN  process-problem-1  is  true. 

RULE-2  IF  process-problem-1  is  true 
AND  process-problem-2  is  false 
THEN  decrease  heater-1. 

Assume  that  working  memory  initially  only  contains  the  sensor 
readings: 


.  temperature  =  425 
.  viscosity  =  35.5 

Now,  in  the  first  "recognize-act"  cycle,  the  system  instantiates 
RULE-1  and  adds  the  consequent  "process-problem-1  is  true"  to 
working  memory.  On  the  second  cycle,  RULE-2  will  be  evaluated  as 
true  and  its  action,  "decrease  heater-1,"  will  be  fired.  The 
difference  is  that  the  action  "decrease  heater-1"  is  a  command 
that  directly  affects  control  of  the  process  by  reducing  the  heat 
to  the  process  from  the  heater  labeled  "heater-1."  The  system 
required  two  cycles  to  go  from  the  basic  evidence  (sensor  values) 
to  a  conclusion  (decrease  heater-1)  that  directly  affected  the 
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control  of  the  process.  This  is  an  example  of  multiple  levels  of 
reasoning. 

When  the  consequent  of  the  rule  adds  to,  or  changes,  the 
contents  of  working  memory,  as  RULE-1  did,  then  the  consequent 
action  is  called  an  internal  command.  The  term  "internal"  is 
derived  from  the  fact  that  the  consequent  has  no  direct  affect  on 
any  part  of  the  system  outside  of  the  knowledge-based  program. 
This  differs  from  an  external  command  which  provides  the  control 
actions  by  direct  manipulation  of  the  final  control  elements  (or 
movement  of  the  set-point) . 

The  use  of  a  single  level  of  reasoning  or  multiples  levels 
of  reasoning  is  dictated  by  the  structure  of  the  knowledge 
elements  (productions,  frames,  etc.)  in  the  knowledge  base.  A 
system  would  be  limited  to  a  single  level  of  reasoning  if  every 
knowledge  element's  resulting  action  (consequent)  was  composed  of 
only  external  commands  that  directly  manipulate  the  process 
variables  used  for  control  (fuel  flow,  steam  pressure,  etc.). 
Since  the  external  command  does  not  directly  affect  working 
memory,  the  resulting  action  fired  at  one  cycle  can  never 
directly  participate  as  a  precondition  of  the  second  cycle.  In 
this  type  of  knowledge  structure,  the  preconditions  of  the 
knowledge  elements  would  only  use  information  obtained  directly 
from  sensory  data.  This  information  would  take  the  form  of 
numeric  values  of  sensor  readings,  manipulated  sensory  values 
(calculated  first  and  second  differences  between  past  and  present 
values),  and  operating  conditions  input  by  the  operators.  For  a 
knowledge-based  system  of  this  type  (single  level  of  reasoning) , 
the  alternatives  within  the  conflict  set  will  all  be  of  the  same 
type.  That  is,  they  will  all  be  external  commands  that  directly 
affect  some  external  variable.  This  means  that  the  conflict 
resolution  system  will  be  handling  alternatives  of  the  same  type, 
thus  reducing  the  complexity  of  comparing  and  ordering  the 
alternatives. 

In  a  system  employing  multiple  levels  of  reasoning,  the 
resulting  actions  of  a  knowledge  element  can  be  composed  of 
either  an  external  command,  internal  command,  or  both.  Some 
cycles  will  produce  external  commands,  and  some  will  not. 
Therefore,  the  conflict  resolution  system  must  be  designed  to 
handle  the  possibility  of  a  mixture  of  command  types.  This 
presents  a  problem  in  determining  an  appropriate  means  to  compare 
the  worth  of  alternatives  that  sejrve  different  purposes  (the 
external  command  directly  affect  the  process,  while  the  internal 
commands  affect  the  perceived  state  of  the  process) . 

The  nature  of  the  alternatives  will  also  be  different  for 
systems  that  incorporate  additional  capabilities  into  the  control 
system  (like  optimization,  alarm  advising,  etc.).  Again,  these 
systems  can  be  structured  to  reason  on  one  or  more  level. 
Consider  first  a  multiple  level  system.  For  this  system,  internal 
and  external  commands  still  exist,  but  now  they  serve  different 
purposes  (affect  either  control,  optimization,  or  some  other 
factor  like  shutdown) .  This  creates  a  new  mixture  of  alternatives 
within  the  conflict  set.  Some  alternatives  might  be  concerned 
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with  control,  some  with  optimization,  and  some  with  process 
advising.  For  the  single  level  system,  the  conflict  set  only  see 
external  commands,  but  due  to  the  additional  capabilities,  the 
purposes  of  these  commands  may  also  be  different  (might  be  for 
control,  optimization,  or  process  advising) . 

Therefore,  the  nature  of  the  alternatives  contained  in  the 
conflict  set  of  a  knowledge-based  process  control  program  will 
vary.  They  can  range  from  a  simple  set  of  external  commands  with 
one  purpose,  to  a  complex  set  of  internal  and  external  commands 
with  different  purposes. 


4.4  Applicable  Strategies  for  Conflict  Resolution 

In  this  section,  each  of  the  numeric  and  heuristic  classes 
of  conflict  resolution  strategies,  discussed  in  Chapter  2,  will 
be  examined  for  its  applicability  within  the  framework  of  a 
knowledge-based  process  control  system.  Because  production  rules 
and  rule-based  frames  are  the  typical  method  for  knowledge 
representation  in  knowledge-based  process  control  applications, 
the  discussion  will  be  based  on  these  representations. 


4.4.1  Heuristic  Strategies 

PRODUCTION  ORDER;  In  a  process  control  system,  the  "production 
order"  strategy  class  can  be  applied  by  constructing  the 
knowledge  base  such  that  the  most  important  rules  are  placed  in  a 
priority  position  (either  first  or  last  depending  on  selection 
criteria) .  For  a  single  variable  control  system,  a  small 
knowledge  base  can  be  ordered  to  provide  a  system  with  strong 
selectivity.  However,  as  the  size  of  the  knowledge  base  increases 
it  becomes  difficult  to  determine  the  exact  placement  of  a  rule 
in  relation  to  all  the  others.  This  problem  also  makes 
maintenance  of  the  knowledge  base  a  difficult  task.  One  possible 
solution  is  to  divide  the  rules  into  grourvs  based  on  the 
applicability  of  the  rule  to  a  part  (or  phase,  or  state)  of  the 
process.  Then  the  rules  can  be  prioritized  within  each  group. 
Complications  arise  when  a  rule  can  be  identified  as  belonging  to 
more  than  one  group. 

For  a  multivariable  control  system,  the  task  of  prioritizing 
the  rules  in  the  knowledge  base  is  also  a  problem.  Compared  to 
the  single  variable  control  system,  the  task  is  more  complicated 
because  of  the  possibility  of  the  rules  addressing  more  than  one 
control  variable.  It  is  not  an  easy  task  to  prioritize  the 
knowledge  to  reveal  all  the  correct  priorities  for  every  possible 
relationship  that  exists  between  the  knowledge  elements.  Again, 
the  grouping  of  rules  would  help,  but  it  is  still  possible  for  a 
single  rule  to  belong  to  more  than  one  group.  Prioritizing  may  be 
facilitated  using  some  additional  knowledge  representation,  such 
as  a  directed  graph,  to  express  the  relationship  between  the 
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rules  for  the  different  domains.  But  even  this  method  is 
difficult  to  implement. 

The  "production  order”  strategy  is  useful  for  applications 
involving  knowledge  bases  which  are  small  in  size,  and  where  the 
knowledge  is  static  in  nature  and  won't  require  much  editing. 
This  strategy  also  favors  single  variable  control  systems  with 
limited  amounts  of  knowledge;  or,  multivariable  control  systems, 
where  it  is  possible  to  determine  the  relationships  between  the 
rules,  and  each  rule  is  mutually  exclusive. 

Another  "production  order"  strategy  rule  discussed  earlier 
used  confidence  factors  ("priority  values")  as  the  criteria  for 
determining  the  order  within  the  conflict  set.  In  a  process 
control  application,  the  priority  values  assigned  to  a  rule  can 
represent  several  things  depending  on  the  designer.  For  example, 
a  priority  value  can  be  assigned  which  is  nothing  more  than  a 
number  which  represents  the  importance  of  that  rule  in  comparison 
to  all  other  in  the  knowledge  base.  This  implementation  is 
identical  to  the  production  order  strategy,  where  the  production 
order  is  not  based  on  physical  location,  but  on  the  priority 
number  assigned.  Another  means  of  using  the  priority  value 
entails  using  it  to  represent  the  probability  (confidence)  that 
the  affect  stated  by  the  rule  is  true  given  the  evidence  stated 
in  the  preconditions.  In  either  case,  this  strategy  requires  that 
the  knowledge  engineer  determine  and  assign  the  priority  value 
prior  to  the  use  of  the  system.  Assignment  of  the  values 
themselves  is  not  an  easy  task  and  leaves  room  for  error  in 
judging  the  worth  of  a  rule.  This  produces  the  same  weaknesses 
that  the  production  order  strategy  possesses.  The  selectivity  of 
this  strategy  is  dependent  on  the  method  used  to  assign  values. 
If  not  two  rules  possess  the  same  probability,  then  the 
selectivity  will  be  strong  and  definite.  However,  in  a  typical 
case  it  is  not  possible  to  ensure  that  no  two  rules  possess  the 
same  value.  The  selectivity  of  the  strategy  will  decrease  as  the 
chance  of  multiple  values  increases.  However,  this  strategy  is 
useful  in  that  it  provides  a  means  for  expressing  the  confidence 
that  the  designer  has  in  each  piece  of  knowledge  used  within  the 
system.  This  information  is  useful  the  knowledge  is  examined  at  a 
later  date,  or  by  someone  other  than  the  designer. 

SPECIAL  CASE;  The  next  conflict  resolution  strategy  is  the  class 
of  "special  case"  rules.  The  determination  of  the  criteria  for 
specifying  a  special  case  can  make  use  of  a  rule's  preconditions, 
and/or  the  instantiations  of  the  rule  that  result  from  the 
"match"  step.  The  selectivity,  of  this  strategy  is  dependent  on 
the  criteria  employed.  When  the  system  arrives  at  two 
alternatives  and  one  of  the  alternatives  is  a  special  case  of  the 
other,  then  it  is  safe  to  assume  that  the  action  taken  by  the 
more  specific  instantiation  will  also  compensate  for  the  more 
general  instantiation.  As  an  example,  lets  define  the  "special 
case"  criteria  to  be  the  following  [3]: 
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"Instantiation  is  a  special  case  of  instantiation 
I  ,  if  Ig  contains  as  a  proper  subset,  all  the  data 
elements  contained  in  1^" 

This  criteria  is  only  one  of  the  many  possible  criterion  that  can 
be  defined. 

To  demonstrate,  consider  an  autoclave  system  that  is  in  the 
process  of  laminating  composite  materials  (a  batch  process) . 
Assume  that  the  knowledge-based  process  control  system  has 
detected  an  accelerated  reaction  within  the  laminate.  This  event 
is  given  by  the  rule: 

IF  (  mid-layer-temp  *rising  ) 

AND  (  top-layer-temp  *rising  ) 

AND  (  mid-layer-slope  >  top-layer-slope  ) 

THEN  (  heater  *decrease  ) 

The  action  specified  by  the  system  is  to  reduce  the  heat  to  the 
autoclave  to  bring  the  reaction  under  control.  Now  if  the  system 
is  also  able  to  detect  a  fire  using  the  (fictitious)  rule: 

IF  (  mid-layer-temp  *rising  ) 

AND  (  top-layer-temp  *rising  ) 

AND  (  mid-layer-slope  >  top-layer-slope  ) 

AND  (  temp  >  fire-temp  ) 

THEN  (  autoclave  *off  ) 

AND  (  pressure  *off  ) 

AND  (  alarm  *on  ) 

Then  the  system  needs  to  stop  processing  and  sound  an  alarm  to 
prompt  immediate  action  from  the  operator.  Examination  and 
comparison  of  the  preconditions  would  reveal  that  all  the 
instantiations  of  the  "accelerated  reaction"  rule  are  a  subset  of 
the  "fire"  rule.  Therefore,  if  both  rules  were  present  within  the 
conflict  set,  the  "fire"  rule  would  be  recognized  as  a  "special 
case"  of  the  "accelerated  reaction"  rule  and  would  receive 
priority  by  elimination  of  the  "accelerated  reaction"  rule  from 
the  conflict  set.  The  action  performed  as  a  result  of  the  firing 
of  the  special  case  rule  will  also  provide  the  necessary  control 
action  for  handling  the  problem  noted  in  the  general  rule. 

Since  it  is  not  likely  that  all  the  members  of  the  conflict 
set  will  be  a  special  case  of  other  alternatives,  this  strategy 
can  be  labeled  as  being  weakly  selective.  That  is,  the  strategy 
only  has  the  capability  of  eliminating  instantiations  within  the 
conflict  set  that  are  general  cases  of  other  instantiation  in  the 
conflict  set.  This  strategy  would  also  require  care  in  the 
construction  of  the  rules  for  the  knowledge  base  to  ensure 
compatibility  in  the  specification  of  the  rule  preconditions.  It 
is  necessary  that  the  rule  be  constructed  using  only  those 
preconditions  that  are  absolutely  pertinent  to  the  identification 
and  handling  of  that  process  condition.  If  a  rule  is  constructed 
haphazardously  and  other  preconditions  are  added  that  are  not 


necessary  to  define  the  state,  then  the  selectivity  of  this 
strategy  will  be  greatly  reduced,  since  it  will  be  rare  that  a 
special  case  will  exist.  For  example,  consider,  again,  the  two 
rules  above,  ‘'accelerated  reaction”  and  "fire."  If  the 
"accelerated  reaction"  rule  was  rewritten  as: 

IF  (  mid-layer-temp  ‘rising  ) 

AND  (  top-1 ayer-temp  ‘rising  ) 

AND  (  mid-layer-slope  >  top-layer-slope  ) 

AND  (  mid-layer-slope  >0  ) 

THEN  (  heater  ‘decrease  ) 

Then  the  "fire"  rule  would  not  be  a  special  case  of  this  rule, 
due  to  the  extra  precondition  "mid-layer-slope  >  0."  This 
precondition  is  not  required  since  it  is  implicit  in  the  fact 
that  the  mid-layer-temp  is  indicated  as  "rising."  The  poor 
construction  of  the  rule  led  to  a  breakdown  in  the  selectivity  of 
the  conflict  resolution  strategy.  Although  not  easily  detected, 
this  problem  can  be  avoided  by  careful  construction  and 
examination  of  the  contents  of  each  rule. 

The  "special  case"  class  of  conflict  resolution  is 
applicable  to  all  knowledge-based  systems  in  process  control.  The 
strategy  essentially  says  that  the  system  should  favor  those 
process  states  (problems)  that  are  identified  by  rules  that  use 
more  preconditions,  because  they  are  more  exact  in  their 
identification  of  process  states.  This  type  of  logic  is 
applicable  for  systems  which  address  physical  processes. 

RECENCY t  The  third  strategy  class  of  conflict  resolution  rules 
uses  age  as  the  basis  for  ordering  alternatives  within  the 
conflict  set.  This  class  of  rules,  referred  to  as  the  "recency 
rules,"  keeps  track  of  the  amount  of  time  that  the  data  elements 
have  been  posted  to  working  memory.  Implementation  of  this 
strategy  could  favor  older  or  newer  data  elements,  depending  on 
the  needs  of  the  system.  The  method  of  determining  the  age  of  an 
instantiation  is  dependent  on  the  user.  McDermott  and  Forgy  [3], 
mention  two  possible  methods  for  defining  the  age  of  a  data 
element  in  working  memory.  The  first  defines  the  age  based  on  the 
number  of  interpreter  (recognize-act)  cycles  that  have  elapsed 
since  that  element  was  asserted.  The  second  interprets  age  as  the 
number  of  actions  that  have  fired  since  the  firing  of  the  action 
that  asserted  that  particular  data  element.  The  first  definition 
of  age  is  less  selective  since  it  is  possible  for  more  than  one 
data  element  to  be  asserted  in  a  single  cycle.  Therefore,  two 
alternatives  in  the  conflict  set  could  be  defined  as  having  the 
same  recency  criteria.  The  second  definition  is  more 
discriminating  since  no  two  data  elements  would  share  the.  same 
age. 

The  fundamental  idea  of  this  strategy  is  to  provide 
continuity  in  the  system  operation.  As  an  example,  consider  a 
process  control  system.  The  idea  behind  control  is  to  maintain 
the  specified  control  variable(s)  at  prescribed  conditions  (i.e., 
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set-points) .  Therefore,  the  firing  of  an  alternative  will 
manipulate  some  process  variable  to  correct  a  control  variable. 
On  the  next  cycle  of  the  system,  any  new  alternatives  that  are 
added  to  the  conflict  set  will  most  likely  represent  process 
conditions  that  are  the  result  of  the  application  of  the  previous 
correction.  If  the  new  data  elements  are  favored  over  the  old, 
then  the  system  continues  to  correct  for  the  original  problem. 
For  the  sake  of  clarity,  the  example  used  in  the  chapter  on 
conflict  resolution  will  be  repeated: 

"Suppose  that  on  the  first  cycle  two  rules  (R1  and  R2) 
were  instantiated  and  the  rule  R1  was  selected  to  fire. 

In  the  next  cycle,  if  rules  Rl,  R2,  and  R3  are 
instantiated,  then  the  resolution  strategy  would  select 
R3  since  this  is  evidently  the  action  to  finish  what  Rl 
started.  This  strategy  has  the  effect  of  focusing  the 
system  on  the  results  of  the  actions  most  recently 
taken. " 

This  idea  of  focusing  the  control  system  would  be  beneficial  in  a 
process  control  systems  since  it  improves  the  operating 
efficiency  of  the  system. 

A  problem  can  result  in  the  use  of  this  strategy  for  some 
cases.  Since  Rl  was  chosen  over  R2,  it  is  deemed  that  Rl 
recognizes  a  more  important  problem  than  R2.  In  the  next  cycle 
when  R3  and  R2  appear  in  the  conflict  set,  R3  is  favored  over  R2. 
This  assumes  that  the  condition  identified  by  R3  (considered  to 
be  a  continuance  of  the  correction  begun  in  Rl)  is  more  important 
than  R2.  This  might  not  always  be  true,  depending  on  what 
condition  R2  identifies.  Therefore,  the  selectivity  of  this 
strategy  is  strong,  but  it's  usefulness  can  be  limited  and  would 
be  more  appropriate  when  applied  in  combination  with  some  other 
conflict  resolution  rules.  This  would  help  to  prevent  the 
situation  of  focusing  on  a  problem  of  less  significance,  than 
others  in  the  conflict  set. 

When  the  recency  strategy  is  applied  to  a  knowledge-based 
control  system  that  operates  at  only  one  level  of  reasoning,  then 
no  internal  commands  (posting  of  assertions  to  working  memory) 
will  be  fired  from  the  rules.  The  only  changes  to  working  memory 
will  come  as  the  result  of  changes  in  the  sensory  values  and 
direct  manipulation  of  these  values.  For  these  systems,  the 
calculation  of  the  age  of  the  data  elements  in  working  memory  is 
of  no  use  since  the  contents  of  working  memory  are  updated  at 
each  sample.  The  only  real  indication  of  age  would  be  to  track 
the  time  between  significant  changes  in  each  value..  This  would  be 
a  complex  task. 

In  a  system  employing  multiple  levels  of  reasoning,  some  of 
the  rules  will  make  assertions  to  working  memory  (fire  internal 
commands) .  The  knowledge  base  will  contain  a  mixture  of  rules. 
Some  rules  will  have  preconditions  that  pertain  only  to  sensory 
input.  Other  rules  will  have  preconditions  that  are  a  mixture  of 
sensory  input  and  previously  asserted  commands.  The  question 
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becomes:  how  will  the  system  determine  the  age  of  both  types  of 
rules?  Is  sensory  input  considered  to  be  of  age  zero?  If  so,  then 
rules  composed  of  only  sensory  input  preconditions  will  always  be 
favored.  Because  of  the  confusion,  the  recency  strategy  based  on 
the  age  of  the  data  elements  would  not  be  applicable  to  systems 
which  use  sensory  input  (i.e.,  process  control  systems). 

However,  the  strategy  of  recency  could  be  applied  to  sensory 
based  systems  if  the  determination  of  age  was  based  on  some 
criteria  other  than  the  data  elements.  One  logical  choice  would 
be  to  calculate  the  age  of  an  alternative  based  on  the  amount  of 
time  that  has  transpired  since  it  last  fired.  This  criteria  could 
be  applied  to  rules  with  preconditions  that  contained  either 
sensory  data,  internal  commands,  or  a  mixture  of  the  two. 
Therefore,  the  capability  to  focus  the  system  would  still  exist. 

DISTINCTIVENESS  I  Distinctiveness  rules  ma)ce-up  the  fourth  class 
of  conflict  resolution  rules.  The  selection  criteria  can  be  based 
on  differences  or  similarities  that  exist  between  the 
alternatives  within  the  conflict  set  to  those  previously 
executed.  In  process  control  systems,  operating  in  real-time,  the 
state  of  the  system  will  not  always  change  drastically  between 
samples.  The  amount  of  change  in  the  process  state  recognized  by 
the  knowledge-base  component  will  be  dependent  on  the  application 
(process  time  constants) ,  the  sample  rate,  and  the  disturbances 
affecting  the  process.  If  the  program  identifies  a  problem  and 
makes  a  correction,  there  will  be  a  time  delay  before  the  process 
responds  to  the  correction.  Also  there  will  be  an  additional  time 
delay  before  the  process  can  be  considered  to  have  moved  out  of 
that  problem  state.  Thus,  it  is  likely  that  the  program  will 
continue  to  identify  the  problem  in  subsequent  cycles.  It  would 
not  be  appropriate  to  prescribe  the  corrective  action  again  since 
this  would  lead  to  overcompensation.  This  continual 
overcompensation  could  result  in  the  process  becoming  unstable. 

One  solution  is  to  increase  the  cycle  time  of  the  knowledge- 
based  program.  This  allows  for  a  greater  chance  of  change  between 
examination  of  the  process  states.  However,  if  there  exist  other 
process  conditions  with  smaller  time  constants,  the  system  might 
not  be  able  to  react  quickly  enough  to  provide  proper  control.  A 
better  solution  would  be  to  recognize  that  the  system  fired  the 
same  rule  previously  and  block  it  from  firing  again  until  some 
criteria  is  satisfied.  This  criteria  would  fall  under  the 
specification  of  the  "distinctiveness”  strategy  of  conflict 
resolution. 

For  application  to  continuous  processes,  this  strategy  would 
require  placing  limits  on  how  far  back  in  time  the  system  would 
consider  previous  instantiations.  The  system  would  need  to  purge 
old  information  about  instantiations  to  reduce  the  system  memory 
requirements.  Even  though  this  strategy  is  weakly  selective,  it 
performs  the  necessary  function  of  reducing  the  possibility  of 
overcompensation  in  the  control  of  the  process. 
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ARBITRARY  DECISION;  The  application  of  "arbitrary  decision" 
rules  to  process  control  is  valuable  when  combined  with  other 
strategies.  This  strategy  will  provide  the  necessary  selection 
when  other  strategies  fail.  This  occurs  when  there  remains  two  or 
more  alternatives  in  the  conflict  set  that  show  no  distinction 
from  one  another  after  all  the  other  strategies  have  been 
applied.  It  is  at  this  point  that  a  random  selection  must  be 
made. 

META -RULES ;  The  last  class  of  heuristic  strategies  involves  the 
use  of  meta-rules.  Meta-rules  possess  knowledge  about  the  utility 
of  the  object  level  knowledge  contained  in  the  knowledge  base. 
This  meta-knowledge  is  used  to  provide  a  selection  from  among  the 
alternatives  in  the  conflict  set.  Meta-rules  can  be  advantageous 
to  a  knowledge-based  process  control  system  since  they  provide 
the  opportunity  to  make  a  decision  based  on  process  knowledge. 
This  allows  the  conflict  resolution  system  to  take  into  account 
the  current  process  status  when  making  its  decision  (see  example 
Chapter  2) . 


4.4.2  Numeric  Strategies 

Numeric  strategies  are  the  most  popular  methods  for 
providing  conflict  resolution  in  knowledge-based  system  for 
process  control.  These  methods  are  similar  to  the  priority 
strategy  that  uses  confidence  factors  for  priority  values.  Both 
provide  a  means  to  gain  insight  into  the  validity  of  a  conclusion 
based  on  the  evidence  given.  However,  in  the  numeric  strategies, 
the  belief  values  are  not  static,  but  instead  are  propagated 
through  the  line  of  reasoning.  These  methods  provide  a  means  for 
relating  the  decisions  made  by  the  system,  to  the  physical 
constraints  of  the  process.  For  example,  if  the  reliability  of  a 
sensory  reading  is  low,  then  this  fact  can  be  incorporated  into 
the  decision  making  of  the  process.  No  heuristic  technique 
provides  this  capability,  except,  the  priority  strategy  (which 
could  be  placed  in  the  numeric  category) . 

It  is  for  this  reason  that  numeric  strategies  are  so  popular 
for  knowledge-based  systems  used  in  physical  processing 
applications.  This  method  of  conflict  resolution  incorporates 
reasoning  about  uncertainty  to  allow  the  system  to  indirectly 
deal  with  additional  information  about  the  process  that  is 
provided  in  numeric  form  (individual  confidence  factors) . 


4.4.3  Conclusions 

Table  4.1  lists  the  major  characteristic  and  strength  of 
selectivity  of  each  of  the  conflict  resolution  strategies 
discussed  above.  As  can  be  seen,  no  one  strategy  provides  all  of 
the  necessary  characteristics  that  a  process  control  system 


needs.  It  will  be  necessary  to  construct  a  strategy  which  is  a 
combination  of  these. 


Table  4.1:  Conflict  resolution  strategies  for  knowledge- 

based  process  control  systems. 


Strateav 

Selectivitv 

Characteristic 

Recency 

strong 

Allows  system  to  focus. 

Arbitrary  Decision 

strong 

Makes  definite  selection. 

Production  Order 

strong 

Easy  to  apply  in  small 
systems . 

Confidence  Factors 

moderate 

Provides  understanding  of 
confidence  in  knowledge. 

Numeric  Methods 

moderate 

Attaches  physical  signifi 
cance  to  knowledge. 

Meta-rules 

moderate 

Reasons  about  utility  of 
alternatives. 

Special  Case 

weak 

Eliminates  the  general- 
case  instantiations. 

Distinctiveness 

weak 

Reduces  chance  of  over¬ 
compensation. 

Only  the  production  order,  recency,  and  arbitrary  decision, 
strategies  can  be  used  by  themselves  (although  the  recency 
strategy  is  not  recommended) .  Their  strong  selectivity  provides  a 
means  of  eliminating  all  other  alternatives,  ensuring  that  only  a 
single  alternative  will  remain  for  firing.  These  three  strategies 
can  also  be  used  in  combination  with  the  other  weakly  selective 
strategies.  This  will  ensure  the  selection  of  a  single 
alternative.  The  simplest  methods  to  implement  are  the 
"production  order"  and  "arbitrary  decision"  strategies.  The  other 
strategies  will  require  additional  considerations  in  programming. 

The  strategies  presented  in  this  chapter  represent  the  main 
classes  of  the  techniques  of  conflict  resolution.  Changes  in  the 
criteria  for  a  strategy  class  can  result  in  a  particular 
implementation  that  either  enhances  or  reduces  the  selectivity  of 
the  technique.  This  tailoring  of  the  applied  strategies  for  a 
particular  application  is  desirable  to  optimize  the  efficiency  of 
the  system. 


4 . 5  Example  Systems 

A  number  of  applications  of  knowledge-based  systems  in 
manufacturing  have  been  reported.  These  applications  include 
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monitoring,  diagnosis,  advising,  scheduling,  alarm  handling, 
control,  etc.  In  this  review,  only  those  articles  which  allude  to 
methods  for  conflict  resolution,  within  the  framework  of 
knowledge-based  process  control,  will  be  presented. 

Of  ninety-six  Al-based  process  control  references,  only  five 
address  the  issue  of  conflict  resolution,  and  these  do  not 
provide  much  detail  on  the  methods  used.  These  articles  just 
mention  in  passing  the  need  or  use  of  some  means  to  address  the 
problem  of  conflicting  alternatives.  In  the  discussion  that 
follows,  the  methods  of  these  five  articles  are  presented. 

Burg,  et.al.  [17]  developed  a  real-time  knowledge-based 
process  control  system  which  controls  a  5  Kw  metal  cutting  laser. 
The  system  receives  its  sensory  data  from  an  image  processing 
system  and  provides  control  of  the  laser  power,  the  gas  supply, 
and  the  velocity  of  the  workpiece.  The  system  uses  production 
rules  for  knowledge  representation  and  employs  a  priority  scheme 
for  resolution  of  conflicts.  The  priority  strategy  performs  the 
ordering  of  the  alternatives  based  on  a  weight  function 
associated  with  each  rule.  The  weight  for  each  rule  is  defined 
using  two  parameters:  the  priority  coefficient,  and  the  execution 
time  needed  to  perform  the  action  specified.  The  conflict 
resolution  system  will  order  the  rules  according  to  their 
priority  values  and  then  check  to  see  if  the  best  one  is  able  to 
fire  within  the  time  constraint.  If  not,  then  the  system 
evaluates  the  next  rule  for  firing,  and  so  on.  This  scheme  allows 
the  system  to  operate  within  the  time  constraints  of  the  process, 
but  it  does  not  guarantee  that  the  constraint  will  be  satisfied. 
Flexibility  is  provided  by  allowing  the  system  to  modify  the 
parameter  values  during  system  operation.  This  scheme  is  not  a 
numerical  method  because  the  weights  are  used  directly  and  not 
propagated  through  the  reasoning  process.  This  approach  is  a 
priority-based  heuristic  technique  that  belongs  to  the 
"production  order"  class  of  conflict  resolution  strategies.  The 
use  of  the  execution  time  provides  further  selection,  by 
eliminating  those  alternatives  which  don't  satisfy  a  current 
process  constraint. 

Gallanti,  et.al.  [18],  presents  a  new  architecture  which 
combines  declarative  knowledge,  in  the  form  of  production  rules, 
with  procedural  knowledge,  in  the  form  of  event-graphs  (also 
referred  to  as  directed-graphs) .  The  only  mention  of  conflict 
resolution  strategies  is  in  reference  to  a  "credibility  value," 
which  is  defined  for  each  active  diagnosis/intervention  event 
graph.  The  credibility  value  represents  the  probability  that  the 
event-graph  is  the  most  appropriate  current  means  for  explaining 
the  ongoing  phenomenon  and  for  suggesting  the  right  actions. 
Maintenance  of  the  credibility  values  is  performed  at  the  control 
level,  which  embodies  the  declarative  knowledge.  No  mention  of 
the  specific  strategy  for  the  use  of  these  values  is  given.  It  is 
assumed  that  a  priority  ordering  is  the  most  likely  method  of 
employing  these  values  to  order  the  alternative  diagnoses  and  to 
make  a  selection. 
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Moore  and  Hawkinson  [19]  present  some  of  the  details  of  the 
PICON  system  for  process  control.  PICON  is  a  knowledge-based 
system  that  runs  on  a  LISP-class  machine  and  is  capable  of 
providing  control  for  processes  involving  large  numbers  of 
process  variables.  PICON  uses  both  deep  knowledge  and  heuristic 
knowledge  with  a  frame-based  representation.  Within  a  frame,  a 
certainty  factor  is  included  as  a  dynamic  attribute.  No  further 
mention  of  the  manipulation  of  these  factors  is  given.  It 
is  possible  that  the  certainty  factors  are  used  in  some  type  of 
numeric  or  priority  based  strategy  of  conflict  resolution. 

In  an  article  by  Rao,  Jiang,  and  Tsai  [20],  the  authors 
present  a  new  architecture  for  real-time  intelligent  control 
systems.  The  design  of  this  architecture  is  based  on  the  belief 
that  the  coordination  of  symbolic  reasoning  and  numeric 
computation  is  essential  to  developing  an  intelligent  process 
control  system.  This  design  couples  several  symbolic  reasoning 
systems  and  numerical  computation  routines,  and  uses  a  meta 
system  to  control  the  selection,  operation,  and  communication  of 
these  programs.  The  meta-system  performs  the  task  of  finding  the 
optimal  solution  when  presented  with  conflicting  facts  and  events 
from  different  expert  systems.  Again,  no  further  information  is 
provided  about  how  the  system  handles  conflicts,  only  the 
statement  that  the  use  of  the  certainty  factors  is  "to  aid  in 
representing  the  imprecise  knowledge  and  indicate  the  reliability 
of  decision-making"  [20]. 

HEXSCON,  a  Hybrid  Expert  System  Controller,  is  a  knowledge- 
based  expert  system  intended  to  deal  with  control  problem.s  [21]. 
When  multiple  lines  of  reasoning  arrive  at  conflicting 
alternatives,  HEXSCON  relies  on  a  numerical  based  technique  for 
conflict  resolution  based  on  the  techniques  of  Dempster  and 
Shafer. 

A  note  worth  making  is  that  in  these  five  references,  each 
implementation  makes  use  of  either  a  numeric  based  technique  or 
some  form  of  a  priority  based  method  employing  confidence 
factors.  In  the  next  section,  a  reason  for  this  selection  will  be 
proposed. 


4 . 6  Conflict  Resolution  in  Knowledae-based  Process  Control 

After  examination  of  the  literature  dealing  with  knowledge- 
based  process  control  systems,  it  appears  that  the  favored  method 
for  conflict  resolution  is  the  use  confidence  factors.  These 
confidence  factors  are  employed  in  one  of  two  ways.  They  can  be 
used  as  static  values  which  indicate  the  probability  of  a  single 
deduction.  A  second  possibility  is  to  use  them  as  dynamic  values 
which  indicate  a  degree  of  belief  that  will  contribute  to 
calculating  the  plausibility  for  an  entire  line  of  reasoning.  As 
mentioned  earlier,  these  methods  allow  the  knowledge  engineer  to 
attach  a  degree  of  belief  to  the  knowledge  elements  (rules) .  This 
provides  a  means  for  representing  the  actual  characteristics  of  a 
particular  process.  In  a  typical  process  control  system,  not  all 
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of  the  interfacing  hardware  (sensors,  meters,  etc.)  has  the  same 
reliability  or  accuracy.  Therefore,  the  evidence  based  on  the 
data  provided  by  sensors  should  not  be  weighted  equally.  In 
addition,  it  is  possible  that  some  rules  within  the  system  have 
not  been  proven,  and  it  would  be  beneficial  to  be  able  to  attach 
some  degree  of  belief  to  the  conclusions  that  they  provide. 

For  a  system  using  a  single  level  of  reasoning,  it  is  not 
possible  to  implement  dynamic  confidence  values.  This  is  because 
the  conclusion  fired  represents  an  external  command  that  was 
directly  deduced  from  the  sensory  data  (or  manipulations  of  this 
data)  .  The  conclusion  reached  is  not  dependent  on  some  line  of 
reasoning  based  on  evidence  obtained  from  some  other  rules. 
Therefore,  for  these  single  level  reasoning  systems,  there  is  no 
need  for  consideration  of  dynamic  confidence  values. 

The  heuristic  methods  are  still  valid  methods  of  conflict 
resolution.  In  fact,  even  the  two  methods  above  will  require  that 
a  heuristic  method  (such  as  arbitrary  decision,  production  order) 
be  included  to  handle  the  possible  case  of  a  tie  in  the  selection 
of  an  alternative.  Any  system  that  does  not  employ  a  strategy 
based  on  confidence  factors  must  use  heuristic  strategies.  It  is 
these  heuristic  methods  that  are  not  mentioned  in  the  literature 
pertaining  to  process  control  systems.  Maybe  the  authors  of  those 
articles  think  it  a  trivial  matter  and  not  worth  mentioning.  The 
selection  and  combination  of  the  different  methods  of  conflict 
resolution,  will  be  dependent  on  the  desired  characteristics  of 
the  overall  system. 


CHAPTER  5 


QUALITATIVE  PROCESS  AUTOMATION  (QPA) 


5 . 1  Introduction 

Qualitative  process  automation  (QPA)  is  defined  as 
qualitative  physics  applied  to  process  control  [22].  The  idea  of 
qualitative  physics  is; 

•'Qualitative  physics  predicts  and  explains  the  behavior 
of  mechanisms  in  qualitative  terms,  i.e.,  the  goals  for 
the  qualitative  physics  are  to  be  far  simpler  than  the 
classical  physics,  yet  retain  all  the  important 
distinctions  such  as  state,  oscillation,  gain,  and 
momentum  without  invoking  the  mathematics  of 
continuously  varying  quantities  -  differential 
equations,  and  produce  causal  accounts  of  physical 
mechanisms  that  are  easy  to  understand."  [23] 

The  use  of  qualitative  techniques  to  model  the  process,  as 
opposed  to  quantitative  techniques,  allows  for  the  use  of  process 
knowledge  within  the  framework  of  an  Al-based  program. 

In  this  chapter  the  theory  of  qualitative  process  automation 
(QPA)  is  presented.  The  application  of  this  theory  is  discussed 
in  reference  to  autoclave-curing  of  graphite-epoxy  composite 
materials.  The  presentation  includes  a  discussion  of  the 
operation  of  the  QPA  controller,  and  a  detailed  discussion  of  the 
method  of  conflict  resolution  used  by  the  system. 


5.2  QPA  as  a  Knowledge-Based  Controller 

QPA  represents  one  example  implementation  of  the  theory  of  a 
knowledge-based  process  control  (KBPC)  system.  QPA  differs  from 
most  other  systems  in  that  it  uses  qualitative  physics  to 
describe  the  mechanics  of  the  process  under  control .  The 
functions  of  the  QPA  controller  are  to  identify  the  process  state 
and  specify  the  set-point  values  for  each  process  variable.  Thus 
the  QPA  controller  operates  in  a  supervisory  mode  as  a  set-point 
scheduler.  The  ideal  QPA  controller  can  analyze  trends  in  the 
process  from  knowledge  of  the  process  status,  and  can  predict 
possible  outcomes  of  the  control  actions  using  knowledge  of  the 
process  behavior.  The  controller  obtains  knowledge  of  the  process 
status  from  the  sensory  data,  and  from  the  temporal  and  spatial 
derivatives  that  are  derived  from  manipulation  of  the  data.  The 
process  behavior  is  determined  by  interpreting  the  sensory 
information.  For  example,  if  the  first  derivative  of  the  laminate 
temperature  is  positive,  the  laminate  temperature  is  increasing. 

The  basis  of  QPA's  operation  is  to  adapt  the  process  (e.g., 
autoclave,  rector,  robot,  machine  tool,  etc.)  to  meet  a  specified 
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goal.  To  provide  for  temporal  reasoning,  a  process  is  broken  down 
into  a  series  of  episodes.  For  each  episode  a  goal  is  specified. 
It  is  the  job  of  the  QPA  controller  to  drive  the  process  to  meet 
that  goal  by  manipulating  the  controller  set-points.  A  more 
detailed  example  will  be  presented  in  a  later  section. 


5.3  Autoclave  Lamination 

The  QPA  controller  was  first  applied  to  the  curing  of 
graphite-epoxy  composites  in  an  autoclave. 

"The  particular  process  was  selected  not  so  much 
because  it  is  beyond  the  state-of-the-art  in  adaptive 
control  technology  but  because  it  is  on  the  leading 
edge  of  conventional  model-based  process  control 
technology  .  .  .."[24] 

The  current  method  of  curing  composites  relies  on  predetermined 
patterns  (set-point  schedules)  that  describe  the  variation  of 
temperature  and  pressure  during  the  process  cycle.  This  method 
results  in  long  processing  tiroes  (cure  cycles) ,  variations  in 
product  quality,  and  poor  yields.  Employing  the  QPA  controller 
eliminates  the  need  for  explicitly  specifying  these  patterns. 
Instead,  the  QPA  controller  will  monitor  the  process  and  use  its 
knowledge  of  the  process  to  automatically  control  the  cure  cycle. 

The  QPA  controller  receives  sensoiry  data  from  the  autoclave. 
This  data  includes  the  temperature  and  pressure  of  the  autoclave, 
the  temperature  of  the  top  and  middle  layers  of  the  composite, 
and  a  dielectric  reading  of  the  composite  (used  as  an  indication 
of  the  viscosity  of  the  material).  This  sensory  data  is  used  by 
the  system  to  determine  the  status  of  the  process  and  to 
determine  its  behavior.  The  application  of  the  QPA  controller  to 
the  autoclave  process  has  significantly  reduced  processing  times 
for  the  curing  of  the  composites  [25] [26]. 


5.4  Method  of  Operation 

A  block  diagram  of  the  QPA  system  is  shown  in  Figure  5.1. 
From  its  supervisory  position,  the  QPA  controller  drives  the  set- 
points  of  two  conventional  proportional-integral-derivative  (PID) 
controllers.  One  controls  the  autoclave's  temperature  and  the 
other  its  pressure.  The  QPA  controller  is  composed  of  three 
expert  system  subsystems:  the  blackboard  monitor,  the  Parser,  and 
the  Thinker  (see  Figure  5.2).  A  blacJcboard  architecture  is  used 
to  allow  for  communication  between  each  of  the  subsystems.  The 
blackboard  monitor  coordinates  program  control  and  provides  an 
external  interface  to  the  process  (sensory  data  and  controllers) . 
From  the  sensory  data,  the  blackboard  monitor  derives  information 
about  the  behavior  of  the  process.  This  information  is  posted  to 
the  blackboard  along  with  the  raw  data  from  the  sensors.  The 
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Figure  5. it  Block  diagram  of  a  knowledge-based  process 
control  system  based  on  QPA  theory. 


second  subsystem,  the  Parser,  then  uses  this  data  to  identify  the 
process  status.  The  Parser  posts  the  results  to  the  blackboard 
for  use  by  the  Thinker.  The  Thinker  subsystem  then  determines  the 
required  commands  for  maintaining  control  of  the  process  and  for 
achieving  the  goal  of  the  system.  These  commands  are  posted  to 
'  the  blackboard  from  where  they  will  be  sent  out. 

The  knowledge  within  the  subsystems  is  represented  using 
frame-based  rules.  QPA  defines  two  types  of  frames:  process 
instance  frames  and  episode  frames.  The  frame  used  to  represent 
each  process  instance  is  defined  as: 

FRAME:  rule-name 
ISA  type 

PRECONDITIONS  rule-antecedents 
EFFECTS  rule-consequents 

ACTORS  nil  L; 
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Process  Expert  Systerms  Enviror^ment 


Figure  5.2:  Block  diagram  of  the  QPA  controller  [24]. 


A  process  instance  frame  can  represent  a  desirable  process 
condition,  an  undesirable  process  condition  (process  problem) ,  or 
a  process  condition  denoting  a  transition  between  process 
episodes.  In  the  current  implementation,  the  ISA  and  ACTORS  slots 
have  no  functional  meaning.  The  PRECONDITIONS  slot  contains  a 
list  of  process-related  conditions  that  must  be  satisfied  before 
the  system  (Parser)  declares  the  frame  as  active  (instantiated, 
true).  The  EFFECTS  slot  contains  the  consequents,  or  actions,  for 
the  rule  specified  by  the  process  instance.  These  actions 
represent  the  control  commands  prescribed  for  dealing  with  the 

process  state  defined  by  the  frame. 

The  episode  frame  is  used  to  define  each  step  or  phase  of 
the  process.  The  frame  is  defined  as: 

FRAME:  episode-name 

SYMBOL  episode-symbol 

TRANSITION  episode-goal (s) 

AGENDA  process-conditions  L; 
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The  episode  SYMBOL  slot  is  used  by  the  system  to  label  the 
currently  active  episode.  The  TRANSITION  slot  specifies  which 
process  instance  frame  dictates  the  behavior  of  the  episode.  This 
process  instance  frame  that  specifies  the  process  conditions  that 
identify  when  the  episode  is  active  and  the  goal,  or  goals,  to  be 
achieved  during  the  episode.  The  AGENDA  slot  provides  a  list  of 
the  process  instance  frames  that  the  system  should  examine  during 
the  episode.  This  list  of  specific  process  instances  allows  QPA 
to  focus  the  system  on  a  certain  portion  of  the  knowledge  base, 
improving  the  efficiency  of  its  operation.  This  is  similar  to  the 
concept  of  "goal  directedness"  discussed  by  Sauers  and  Walsh  [6]. 

In  summary,  the  Blackboard  monitor  receives  data  from  the 
external  sources  and  derives  behavioral  information  about  the 
process.  The  data  and  information  are  posted  to  the  blackboard 
and  passed  to  the  Parser.  The  Parser  then  determines  the  process 
status  by  identifying  which  process  instance  frames  are  active. 
These  frames  are  labeled  as  active  on  the  blackboard  and  the 
blackboard  is  passed  to  the  Thinker.  The  Thinker  then  identifies 
the  active  episode  and  constructs  a  conflict  set  from  the  list  of 
effects  of  each  active  frame.  The  Thinker  then  resolves  the 
conflict,  posts  its  decision  on  the  blackboard,  and  passes  it 
back  to  the  Blackboard  monitor.  The  Blackboard  monitor  will  then 
execute  the  commands. 


5.5  Conflict  Resolution  in  QPA 
5.5.1  Method  of  Conflict  Resolution 


As  with  other  knowledge-based  systems,  QPA  requires  a  means 
of  conflict  resolution  when  multiple  rules  (frames)  have  been 
instantiated  and  are  applicable  for  firing.  When  a  frame  is 
declared  as  active,  QPA  places  the  effects  (actions)  of  that  rule 
in  a  list  called  PLANS,  which  represents  a  type  of  conflict  set. 
(In  actuality,  the  list  PLANS  contains  the  addresses  of  effect 
found  to  be  active.)  This  list  is  then  consulted  to  determine 
which  effect  to  implement. 

The  effects  prescribed  by  a  frame  are  statements  that  will 
prescribe  some  change  in  a  condition  of  the  process.  Examples  of 
these  effect  statements  include: 


(  laminate-mid-teroperature 
(  degree-of-cure 
(  autoclave-temperature 
(  pressure 
(  heater 
(  heater 


increase  ) 
increase  ) 
decrease  ) 
increase  ) 
hold  ) 
decrease  ) 


Notice  that  some  of  these  prescribe  changes  in  states  that  are 
not  directly  controllable  (e.g.,  degree-of-cure,  laminate-mid¬ 
temperature)  ;  whereas,  some  prescribe  changes  involving  the 
direct  manipulation  of  the  final  control  elements  (heater. 
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pressure) .  Each  of  the  effects  is  passed  through  a  routine  which 
determines  which  manipulative  variable  relates  to  that  change  in 
state  and  how  that  change  in  state  is  influenced  by  the 
manipulative  variable.  This  information  is  obtained  from 
"influence”  statements.  Examples  of  these  statements  include: 

I+(  laminate-mid-temperature  degree-of-cure  ) 

I+(  laminate-top-temperature  laminate-mid-temperature  ) 

I+(  autoclave-temperature  laminate-top-temperature  ) 

I+(  heater  autoclave-temperature  ) 

Although  none  are  shown,  it  is  also  possible  to  have  negative 
influences  (I-)  .  In  addition,  the  system  will  use  the 
CONTROLLABLE  Statement  to  specify  which  process  variables  can  be 
directly  controlled.  Examples  of  these  statements  are: 

CONTROLLABLE (  heater  messagel  ) 

CONTROLLABLE (  pressure  messages  ) 

The  "messagel"  and  "messages"  represent  the  particular  blackboard 
slot  associated  with  that  controllable  variable. 

The  influence  statements  are  structured  to  provide 
information  on  how  one  process  variable  is  affected  by  another. 
For  example,  assume  that  the  action  require  increasing  the 
DEGREE-OF-CURE.  The  system  would  scan  the  influence  statements 
and  determine,  from  the  first  statement  above,  that  the  LAMINATE- 
MID-TEMPERATURE  positively  influences  DEGREE-OF-CURE.  This  tells 
QPA  that  if  it  wants  to  increase  the  degree  of  cure  it  must  raise 
the  temperature  in  the  middle  of  the  laminate.  The  system  can't 
identify  LAMINATE-MID-TEMPERATURE  as  a  controllable  variable,  so 
it  must  address  the  influence  statements  again.  The  next 
influence  statement  reveals  that  LAMINATE-MID-TEMPERATURE  can  be 
increased  by  increasing  the  LAMINATE-TOP-TEMPERATURE.  Again,  this 
is  not  a  controllable  variable,  and  the  system  goes  back  to  the 
influence  statements.  This  action  continues  until  the  system 
reaches  the  statement  identifying  the  HEATER  as  the  variable  that 
can  positively  influence  the  AUTOCLAVE-TEMPERATURE.  Since  HEATER 
is  a  controllable  variable,  the  system  appends  the  command  to 
"increase"  the  HEATER  (turn  it  up)  to  a  list  called  VOTES.  This 
external  command  is  the  EFFECT  that  will  produce  the  desired 
result  of  increasing  the  DEGREE-OF-CURE. 

The  VOTES  list,  like  the  PLANS  list  mentioned  earlier,  is 
also  a  conflict  set,  except  that  VOTES  contains  the  effects  in 
terms  of  controllable  variables.  If  the  effect  given  is  neither 
found  on  the  positive  or  negative  influence  list,  then  the  effect 
itself  is  directly  appended  to  the  list  VOTES.  This  type  of 
effect  would  represent  a  system  characteristic  which  is  not 
influenced  by  other  system  dynamics.  It  would  be  directly 
controlled  without  outside  influence. 

At  this  point  in  the  execution  of  the  program,  the  conflict 
set  contains  all  the  process  instance  effects  that  are  applicable 
to  the  current  condition  of  the  process.  The  Thinker  must  now 
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decide  which  effect  to  implement.  The  strategy  used  to  resolve 
the  conflict  involves  assigning  priorities  to  the  type  of  action 
performed  by  the  effect.  In  QPA,  these  priorities  are  specified 
apriori.  Priorities  are  determined  for  the  final  control  element 
associated  with  each  manipulated  variable  and  for  each  possible 
action  that  can  be  implemented  for  that  control  element.  These 
priorities  are  specified  by  the  use  of  a  RESOLVES  statement 
located  in  the  knowledge  base.  As  an  example,  consider  the  group 
of  RESOLVES  statements: 

RESOLVES (  heater  decrease  increase  ) 

RESOLVES (  heater  decrease  hold  ) 

RESOLVES (  heater  hold  increase  ) 

RESOLVES (  pressure  increase  decrease  } 

Listed  are  all  the  possible  combinations  of  effects  that  can  be 
implemented  for  each  final  control  element.  The  terms  HEATER  and 
PRESSURE  represent  the  final  control  elements  for  the  autoclave. 
The  RESOLVES  statement  structure  is  such  that  the  first  argument 
represents  the  control  element  (controllable)  and  the  next  two 
arguments  represent  possible  actions  to  be  performed  by  that 
control  element.  The  RESOLVES  statement  is  constructed  such  that 
the  first  action  argument  is  the  one  to  favor  in  a  conflict.  For 
example  consider  the  situation  where  a  conflict  exists  in  the 
control  action  prescribed  for  the  heater.  If  the  prescribed 
actions  are  both  "decrease”  and  "increase,”  then  from  the 
RESOLVES  statement  it  is  known  that  whenever  a  choice  between 
"decrease”  and  "increase"  exists,  it  is  best  to  select  the 
"decrease"  action. 

The  RESOLVES  statements  provide  the  knowledge  used  to 
resolve  the  conflicts  within  the  system.  Even  though  this 
knowledge  exists  in  the  knowledge  base,  it  is  the  control  program 
within  the  Thinker  that  decides  how  to  apply  this  knowledge 
(favor  the  first  of  the  two  arguments  in  the  statement)  to 
resolve  conflicts.  This  strategy  falls  under  the  "production 
order"  class  of  heuristic  strategies.  Resolution  of  conflicts  is 
dependent  on  the  placement  of  frames  within  the  knowledge  base, 
and  on  preassigned  priorities  for  alternatives.  The  only 
requirement  for  placement  is  that  the  process  instance  frames 
associated  with  the  episodes  be  placed  in  front  all  other  frames. 
The  remaining  process  instance  frames  require  no  specific 
ordering.  The  assignment  of  priorities  is  similar  to  the  method 
outlined  in  Chapter  2.  The  main  difference  is  that  instead  of 
assigning  priorities  for  all  the  rules  in  the  system  using  some 
means  such  as  confidence  factors,  the  system  only  assigns 
priorities  to  the  possible  actions  that  a  rule  will  prescribe.  In 
this  way,  the  system  does  not  relate  a  specific  action  (in  the 
conflict  set)  to  the  precondition (s)  that  prescribed  that  action. 
This  simplifies  the  process  of  assigning  priorities,  but  results 
in  a  loss  of  physical  significance  due  to  the  generalization  of 
the  effects.  This  generality  is  not  a  problem  unless  the  evidence 
(which  make-up  the  preconditions)  varies  in  certainty,  or  its 
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application  is  not  consistent.  For  example,  assume  a  system  has 
decided  that  due  to  the  evidence  presented,  it  is  12%  sure  that  a 
decrease  in  heat  is  needed,  and  92%  sure  that  an  increase  is 
needed.  The  RESOLVES  statement  would  state  that  a  decrease  in 
heat  is  appropriate,  but  a  process  operator  might  decide  to  favor 
an  increase  for  another  sample  or  two.  For  these  types  of  systems 
with  varying  degrees  of  confidence  in  the  evidence,  it  would  be 
necessary  to  use  a  numeric  method  to  resolve  the  conflicts  and  to 
support  uncertainty  in  the  reasoning. 


5.5.2  Types  of  Conflicts 

In  QPA,  the  alternatives  that  make-up  the  conflict  set 
represent  only  external  commands  with  the  single  purpose  of 
driving  the  set-points  of  the  process.  However,  the  format  of  the 
conflicts  that  occur  are  different  from  what  one  would  find  in  a 
typical  knowledge-based  system.  This  difference  is  due  to  the 
structure  of  the  QPA  system  and  its  use  of  episodes.  First, 
recall  that  under  QPA  control  a  process  is  represented  as  a 
series  of  episodes,  where  each  episode  has  a  process  goal  that 
the  controller  is  trying  to  achieve.  The  episode  goal  is 
described  as  an  effect  (controller  action)  that  is  an  external 
command  to  the  controller.  For  example,  in  the  "cure"  episode  of 
the  lamination  process,  the  goal  is  to  increase  the  degree  of 
cure  of  the  laminate.  Using  knowledge  of  the  process  influences, 
this  episode  goal  translates  into  a  command  to  turn-up  (increase) 
the  heater.  Therefore,  throughout  the  ”cure''  episode,  the 
controller  will  increase  the  temperature  in  the  autoclave.  The 
effect  of  this  goal  will  continue  to  be  applied  until  that 
episode  ends,  or  a  process  problem  occurs  that  requires  a 
different  controller  action. 

The  types  of  conflicts  that  can  occur  in  the  QPA  system  can 
be  described  as: 

1.  Episode/Single  problem  -  conflicts  between  the  episode 
goal  and  a  process  problem  that  occurs  during  that  episode, 
or, 

2.  Episode/Multiple-problems  -  conflicts  between  the  episode 
goal  and  two  or  more  process  problems  occurring 
simultaneously  during  that  episode. 

The  main  type  of  conflict  that  the  occurs  in  the  system  is 
the  ''episode/single  problem"  type.  This  type  of  conflict  results 
from  the  occurrence  of  a  process  instance  that  requires  a  control 
action  which  opposes  the  action  prescribed  by  the  episode  goal . 
For  example,  assume  the  process  is  in  the  "cure"  episode  where 
the  current  control  action  (episode  goal)  requires  that  the 
heater  be  turned  up.  Then,  during  this  episode,  the  knowledge- 
based  system  detects  an  accelerated  reaction.  The  prescribed 
effect  of  the  process  instance  frame  identifying  this  problem  is 
to  decrease  the  heat.  Therefore,  the  conflict  set  will  contain 
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the  two  conflicting  actions  that  the  heater  be  increased  or 
decreased.  Using  the  RESOLVES  statement,  the  system  would  decide 
to  decrease  the  heat. 

The  "episode/single  problem"  conflict  is  a  trivial  conflict 
when  one  considers  the  logic  used  to  control  a  process.  During 
the  control  of  any  process,  the  purpose  is  to  maintain  control  of 
the  process  in  the  face  of  disturbances.  When  a  problem  occurs 
then  it  is  necessary  to  give  it  immediate  attention,  unless  there 
is  another  problem  occurring  that  has  a  higher  priority. 
Therefore,  any  time  the  system  detects  only  a  single  problem, 
there  is  really  no  need  to  consider  it  a  conflict,  the  control 
action  prescribed  by  the  process  problem  should  always  be  given 
priority  attention. 

The  second  type  of  conflict,  "episode/multi-problem,"  can  be 
further  broken  down  into  cases  of  conflicts: 

Case-1:  The  controller  actions  prescribed  by  the  episode 
goal,  and  two  or  more  process  problems  are  in  conflict  one 
with  another. 

Case-2:  The  controller  actions  for  the  two  or  more  process 
problems  are  in  conflict  with  each  other,  but  not  with  the 
episode  goal  because  they  deal  with  a  different  manipulative 
variable. 

The  Case-1  conflict  is  similar  to  the  "episode/single 
problem"  type  of  conflict  discussed  above.  The  difference  is  that 
the  system  has  detected  two  process  problems,  instead  of  just 
one.  Just  as  before,  the  process  problems  have  a  higher  priority 
than  the  episode  goal.  Therefore,  the  job  of  the  conflict 
resolution  strategy  is  to  resolve  the  conflicts  between  the 
control  actions  prescribed  by  the  process  problems. 

The  Case-2  conflict  is  identical  to  Case-1,  except  that  now 
the  episode  goal  addresses  a  different  manipulative  variable  than 
the  process  problems.  For  this  situation,  two  control  actions 
will  be  implemented,  one  from  the  episode  goal  (addressing  one 
manipulative  variable) ,  and  one  which  is  the  result  of  the 
resolution  of  the  conflict  between  the  process  problems 
(addressing  a  different  manipulative  variable) . 

In  addition,  other  combinations  of  conflicts  are  possible. 
However,  the  logic  used  to  address  these  conflicts  would  be 
similar  to  that  discussed  above. 


5.5.3  Distinctions  of  Conflict  Resolution  in  OPA 

The  current  implementation  of  the  QPA  theory  is  in  its 
infancy.  As  was  shown  in  the  previous  section,  the  QPA  program 
employs  a  heuristic  strategy  for  conflict  resolution  that 
provides  complete  selectivity,  ensuring  that  all  alternatives  are 
distinguished  from  each  other  in  terms  of  their  worth.  However, 
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there  are  several  factors  concerning  the  conflict  resolution 
system  that  require  further  explanation. 

As  discussed  in  Chapter  4,  overcompensation  in  the 
assignment  of  control  actions  might  occur  unless  some  method  is 
provided  to  block  alternatives  that  were  previously  fired.  The 
current  implementation  of  QPA  does  not  require  the  blocking  of 
previously  fired  alternatives.  This  is  due  to  the  structure  of 
the  conflict  set  in  the  QPA  system.  In  each  system  cycle  (Parse- 
Think)  the  conflict  set  will  contain  an  alternative  that  is 
placed  there  by  the  episode  goal  EFFECT.  Also,  in  each  cycle  QPA 
will  send  the  PID  controller  a  set-point  scheduling  command.  When 
a  process  problem  is  detected,  the  EFFECT  of  the  process  instance 
frame  associated  with  this  problem  will  be  added  to  the  conflict 
set,  and  this  EFFECT  will  have  a  higher  priority  than  that  of  the 
episode  goal.  Therefore,  the  EFFECT  of  the  process  problem  will 
fire  and  the  controller  will  apply  the  correction  to  alleviate 
the  problem.  If  this  process  problem  alternative  is  blocked  on 
the  next  cycle  of  the  system,  then  unless  another  process  problem 
occurs,  the  episode  goal  will  be  the  only  alternative  in  the 
conflict  set.  If  the  EFFECT  of  the  episode  goal  is  associated 
with  the  same  manipulative  variable  as  the  EFFECT  that  is 
blocked,  the  control  action  of  the  episode  EFFECT  will  be  in 
direct  opposition  to  the  EFFECT  prescribed  in  the  previous  cycle. 
This  results  in  canceling  the  overall  effect  of  the  previous 
command  (from  the  process  problem),  allowing  it  only  one  cycle  to 
act  in  alleviating  the  process  problem. 

QPA  overcomes  this  problem  by  not  blocking  alternatives. 
This  absence  of  blocking  will  allow  the  alternative  EFFECT  posted 
by  the  process  problem  to  remain  a  member  of  the  conflict  set 
until  the  system  no  longer  recognizes  the  process  problem.  This 
results  in  the  process  problem  alternative  continually  being 
selected  over  the  episode  goal  (in  essence  blocking  the  episode 
goal) ,  unless  another  process  problem  occurs  that  has  even  higher 
priority.  In  order  for  this  method  to  work,  it  is  necessary  that 
the  set-point  be  moved  in  small  increments  which  are 
approximately  equal  to  the  time  constant  of  the  process  that  is 
associated  with  that  variable.  If  the  set-point  is  moved  at  a 
rate  faster  than  the  process  can  respond,  then  there  will  be  some 
amount  of  overcompensation. 

Consider  the  composite  lamination  process  as  an  example. 
When  the  process  is  operating  in  the  "cure"  episode,  the  goal  is 
to  increase  the  degree  of  cure  of  the  laminate.  Using  the  list  of 
"influences,”  this  translates  into  a  need  to  turn-up  the  heater. 
So  at  each  Parse-Think  cycle  of  the  system  the  conflict  set  will 
contain  the  alternative  to  turn-up  the  heater.  Assume  that  the 
system  detects  the  process  problem  that  an  accelerated  reaction 
is  occurring  in  the  laminate.  The  effect  of  this  process  problem 
is  to  turn-down  the  heater  (the  process  problem  and  episode 
effect  both  pertain  to  the  same  manipulative  variable  -  heater) . 
The  resultant  conflict  set  will  contain  the  two  alternatives  to 
turn-up  and  turn-down  the  heater.  The  alternative  to  turn-down 
the  heater  is  selected.  As  long  as  the  process  problem  is 
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detected,  both  of  these  alternatives  will  be  present  in  the 
conflict  set,  and  the  action  of  turning  down  the  heater  will 
always  be  selected.  If  the  set -point  is  moved  ten  degrees  at  each 
sample,  where  a  sample  is  made  every  30  seconds,  then  the  overall 
driving  of  the  process  is  20  degrees/minute.  If  the  process  can 
actually  only  heat-up  at  a  rate  of  10  degrees/minute  (in  the 
operating  range) ,  the  controller  will  get  ahead  of  the  process  in 
its  prescription  of  control.  Then,  when  the  process  problem  is 
alleviated,  there  is  a  good  chance  that  the  set-point  value  will 
be  much  less  than  the  current  process  temperature.  So,  when  QPA 
begins  to  prescribe  an  increase  in  the  set-point  value,  there 
will  be  a  delay  until  the  value  reaches  the  current  level  of  the 
autoclave  temperature.  This  effect  can  be  lessened  by  setting 
maximum  and  minimum  values  for  the  set-point  and  more  closely 
matching  the  movement  of  the  set-point  to  the  response  time  of 
the  process. 

The  working  memory  of  QPA  is  based  on  a  blackboard  structure 
composed  an  array  of  storage  slots.  Each  of  these  slots  are 
updated  during  each  parse-think  cycle  of  the  system.  Because  of 
this  updating  scheme  and  the  fact  that  QPA  uses  only  one  level  of 
reasoning,  there  is  no  need  for  non-monotonic  reasoning.  There  is 
no  need  to  retract  facts  from  working  memory  (blackboard)  since 
they  are  updated  during  each  cycle  of  the  system. 

Another  feature  of  QPA  is  that  it  can  be  applied  to 
multivariable  control  systems.  In  these  systems,  QPA  is 
responsible  for  scheduling  the  set-points  of  two  or  more  process 
controllers.  During  each  program  cycle  QPA  is  unique  in  that  it 
has  the  capability  to  select  an  EFFECT  for  each  process  set- 
point.  In  other  knowledge-based  programs  the  resolution  process 
is  restricted  to  making  only  a  single  selection  from  among  the 
conflict  set. 


5 . 6  Other  Methods  of  Conflict  Resolution 

The  current  implementation  of  QPA  is  limited  in  its 
application.  One  reason  is  because  of  the  method  it  uses  to 
resolve  conflicts.  The  resolution  method  is  applicable  only  to 
systems  where  the  set-points  can  be  moved  in  relative  increments 
based  on  qualitative  commands  (increase,  decrease,  hold) .  If 
other  methods  of  resolution  were  employed,  this  limitation  could 
be  removed. 

The  first  addition  to  the  resolution  toolbox  would  be  a 
numerical  method  of  conflict  resolution.  The  capability  to  reason 
about  uncertainty  and  use  this  information  to  resolve  conflicts 
would  be  beneficial  in  QPA.  Instead  of  assuming  that  all  the 
evidence  (sensory  input)  provided  by  the  process  is  equivalent, 
the  system  could  take  into  account  the  peculiarities,  and  faults, 
of  each  piece  of  evidence.  The  resultant  belief  values  obtained 
for  each  instantiated  rule  could  be  used  directly  for  ordering 
the  alternatives  within  a  conflict  set.  This  method,  when  coupled 
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with  one  or  more  heuristic  strategies,  could  be  used  to  provide 
an  a  robust  conflict  resolution  strategy  for  process  control. 

It  is  important  that  the  numeric  methods  be  applied  in 
combination  with  one  or  more  heuristic  techniques  to  ensure  that 
the  system  selects  only  one  alternative  for  implementation.  The 
system  must  distinguish  between  each  alternative  within  the 
conflict  set  in  case  several  alternatives  result  in  the  same 
belief  values.  This  can  be  accomplished  with  a  heuristic  strategy 
such  as  "arbitrary  decision"  or  "production  order." 

Currently,  only  one  heuristic  strategy  is  employed  in  QPA. 
The  implementation  of  this  strategy  is  provided  in  FORTH  code  and 
is  not  easy  to  decipher.  The  best  alternative  would  be  to  offer 
several  strategies  to  the  user  and  allow  the  user  the  freedom  to 
select  which  ones  to  implement  and  the  order  in  which  they  are  to 
be  applied  to  resolve  conflicts. 


5.7  Conclusions 

The  application  of  QPA  theory  has  resulted  in  a  system  that 
possesses  great  potential.  The  QPA  controller  has  worked  well 
with  the  batch  process  of  laminating  composite  materials  in  an 
autoclave.  The  knowledge-based  program  is  capable  of  operating  in 
real-time  to  identify  process  behavior  and  to  provide  appropriate 
control . 

Although  the  method  of  conflict  resolution  employed  by  QPA 
is  limited,  it  is  capable  of  handling  the  conflicts  that  occur  in 
the  lamination  process.  However,  the  incorporation  of  numeric 
techniques  and  other  heuristic  methods  of  conflict  resolution 
would  strengthen  the  applicability  of  the  controller  to  other 
processes  in  the  industrial  environment. 


CHAPTER  6 


METHODS  FOR  PERFORMANCE  IMPROVEMENT  IN  QPA 
THROUGH  CONFLICT  RESOLUTION 


6 . 1  Introduction 

The  goal  of  this  research  was  to  improve  the  performance  of 
knowledge-based  process  control  systems  by  enhancing  the  methods 
used  to  resolve  conflicts.  The  approach  was  to  develop  a  coupled 
system  that  would  combine  heuristic  knowledge  about  the  process, 
in  the  form  of  experienced-based  rules,  with  deterministic 
knowledge  about  process,  in  the  form  of  numerical  equations  which 
model  the  process.  The  hypothesis  was  that  a  purely  heuristic- 
based  system  overlooks  the  knowledge  about  the  process  that  is 
provided  by  mathematical  models.  This  additional  knowledge  source 
can  be  used  to  aid  in  the  resolution  of  conflicts  that  arise  in 
the  system. 


6 . 2  Improving  Conflict  Resolution 

Three  methods  of  improving  system  performance  by  changing 
conflict  resolution  include; 

1.  decreasing  the  number  of  times  a  conflict  occurs, 

2.  reducing  the  time  it  takes  to  resolve  a  conflict,  and 

3.  maximizing  the  capability  of  the  system  to  make  the  best 
selection. 

The  first  method  derives  performance  improvements  by 
reducing  the  number  of  calls  to  the  conflict  resolution  system. 
This  is  implemented  by  structuring  the  knowledge  base  to  minimize 
the  number  of  rules  fired  in  a  single  cycle.  This  requires 
identifying  which  rules  might  be  triggered  at  the  same  time.  This 
is  a  difficult  task  for  physical  processes  because  of  the 
interactions  that  exist  between  the  different  variables.  It  is 
difficult  to  predict  the  conditions  (combination  of  variable 
values)  that  will  occur  during  processing.  Once  these  rules  are 
identified,  then  additional  logic  can  be  added  to  the 
preconditions  to  further  justify  their  applicability.  For 
example,  consider  the  two  rules; 

OVERTEMP  Rule; 

IF  (  autoclave-temp  >  (temp-bandwidth  +  set-pt-limit)  ) 
THEN  (  heater  =  hold  ) 

LARGE -GRADIENT  Rule; 

IF  (  temp-gradient  >  gradient-limit  ) 

TkND  (  autoclave-temp  >  low-layer-laminate-terop  ) 

THEN  (  heater  =  decrease  ) 
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It  is  possible  for  these  two  conditions  to  occur  simultaneously 
during  the  lamination  process.  When  both  rules  are  active,  the 
conflict  resolution  system  would  always  select  the  action  to 
decrease  the  heat  supplied  by  the  heater.  Therefore,  the  system 
should  favor  the  LARGE-GRADIENT  .rule.  To  include  this  in  the 
knowledge  base,  additional  logic  should  be  added  to  the  OVERTEMP 
rule: 

OVERTEMP  Rule: 

IF  (  autoclave-temp  >  (temp-bandwidth  +  set-pt-limit)  ) 
ANDNOT(  temp-gradient  >  gradient-limit  ) 

ANDNOT(  autoclave-temp  >  low-layer-laminate-temp  ) 

THEN  (  heater  =  hold  ) 

This  prevents  the  OVERTEMP  rule  from  firing  whenever  the  LARGE- 
GRADIENT  rule  is  active.  Using  this  approach  throughout  the 
knowledge  base  will  result  in  a  decrease  in  the  number  of 
possible  conflicts  that  the  system  will  encounter. 

A  problem  arises  when  using  this  method.  Examination  of  the 
rule  to  determine  what  physical  condition  it  identifies  would  be 
misleading.  The  preconditions  statement  now  contains  statements 
which  identify  the  process  condition  and  which  also  block  the 
rule  at  appropriate  times.  This  results  in  a  knowledge  base  that 
is  cryptic,  confusing,  and  difficult  to  maintain.  To  reduce  the 
confusion,  the  rule  may  be  restated  as: 

OVERTEMP  Rule: 

IF  (  autoclave-temp  >  (temp-bandwidth  +  set-pt-limit)  ) 
ANDNOT(  LARGE-GRADIENT  ) 

THEN  (  heater  =  hold  ) 

This  rule  structure  reduces  some  of  the  confusion,  but  it  also 
requires  that  rules  be  evaluated  by  the  system  using  a  carefully 
prescribed  order.  This  method  does  not  eliminate  the  need  for 
conflict  resolution,  but  only  reduces  the  number  of  times  it  is 
used.  It  changes  the  structure  of  the  knowledge  base,  and  hampers 
its  construction  and  maintenance.  These  problems  make  this  an 
unattractive  solution. 

The  second  method  improves  system  performance  by  reducing 
the  time  required  to  make  a  selection.  This  method  involves 
reprogramming  the  conflict  resolution  system  such  that  it 
executes  more  efficiently.  Improvement  might  also  be  achieved  by 
employing  a  different  strategy  to  obtain  the  same  result,  but  one 
which  is  faster  (possibly  using  less  logic  or  a  better 
elimination  strategy) . 

The  third  method  to  improve  system  performance  requires 
modifying  the  conflict  resolution  strategy  to  improve  its 
capability  to  select  the  best  alternative.  This  method  is  the 
subject  of  the  following  section. 
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6.3  Using  Deterministic  Knowledge  in  Conflict  Resolution 

The  focus  of  this  research  is  to  improve  conflict  resolution 
in  QPA.  The  approach  selected  is  to  incorporate  deterministic 
techniques,  such  as  mathematical  and  simulation  models,  into  the 
conflict  resolution  strategy.  The  intent  is  to  improve  the 
outcome  of  conflict  resolution.  The  following  paragraphs  discuss 
why  this  approach  is  not  encouraging. 

In  Chapter  5  it  was  shown  that  QPA  resolves  conflicts  using 
the  RESOLVES  statement.  This  technique  provides  a  limited  means 
for  resolving  conflicts.  The  RESOLVES  statement  is  static  and 
conservative,  in  that  it  always  selects  the  alternative  that  is 
the  safest,  independent  of  other  circumstances  or  interactions. 

Research  revealed  that  most  of  the  conflicts  handled  by  the 
RESOLVES  statement  are  of  the  "episode/single-problem''  type.  For 
this  type  of  conflict,  independent  of  any  resolution  strategy, 
the  correct  decision  is  to*  select  the  EFFECT  specified  by  the 
process  problem  (see  discussion  Chapter  5)  .  Therefore,  no  real 
conflict  exist.  For  the  other  conflict  types  mentioned  in  Chapter 
5,  the  system  selects  from  among  alternative  EFFECTS  that  are 
issued  by  active  process  instances  (not  including  the  episode 
goal).  For  these  cases,  the  conflict  set  will  contain  only  two, 
or  three,  alternative  actions  from  which  to  choose.  It  is  these 
types  of  conflicts  that  the  system  must  be  capable  of  resolving. 

The  proposed  method  for  resolving  these  conflicts  involves 
three  steps.  As  each  step  was  explored,  it  was  found  to  be  either 
not  necessary  or  too  complicated  to  be  of  any  real  use.  The  task 
of  the  first  step  is  to  verify  the  existence  of  the  process  state 
represented  by  each  alternative  within  the  conflict  set.  Recall 
that  each  of  these  alternatives  represents  actions  that  are 
necessary  to  eliminate  some  process  problem  (accelerated 
reaction,  large  temperature  gradients,  etc.),  or  to  produce  some 
desired  effect  (provide  compaction  to  the  laminate  when  the  resin 
begins  to  gel) .  It  was  believed  that  some  of  these  process  states 
could  be  verified  using  quantitative  calculations  involving  the 
process  equations  (deterministic  knowledge) .  This  verification 
step  would  eliminate  any  alternatives  that  the  process 
calculations  did  not  support.  However,  this  requires  that  the 
user  place  a  greater  confidence  in  the  deterministic  knowledge 
than  in  the  heuristic  knowledge.  This  requirement  may  not  always 
be  true.  Furthermore,  if  deterministic  knowledge  is  available  to 
accurately  verify  the  process  states,  then  it  would  be  more 
efficient  to  use  it  as  the  object-level  knowledge  in  the 
reasoning  of  the  system.  Thus,  this  is  not  a  viable  first  step. 

The  second  step  involves  using  a  process  model  to  make  a 
simulation  run  for  each  alternative  in  the  conflict  set.  This 
produces  a  process  response  for  each  alternative.  The  system  then 
examines  the  process  responses  and  selects  the  alternative  that 
produced  the  best  response. 

This  step  has  several  weaknesses.  First,  there  is  a  question 
of  the  accuracy  of  the  model.  This  accuracy  will  be  dependent  on 
the  operating  region  under  consideration  and  the  process 
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variables  of  interest.  Second,  the  development  of  a  simulation 
model  is  a  difficult  task.  It  is  not  always  possible  to  include 
in  the  model  all  the  details  of  the  process.  This  reduces  the 
breadth  of  the  application,  thereby  requiring  that  an  alternative 
conflict  resolution  strategy  be  provided  for  those  situation 
where  the  model  is  not  appropriate.  Last  of  all,  in  order  for  the 
system  to  pick  the  best  alternative,  it  would  have  to  possess  the 
capability  to  rate  each  response  dependent  on  the  current 
episode,  and  then  perform  a  comparison  of  these  ratings  so  a 
selection  could  be  made. 

The  complexity  and  difficulty  of  this  step  reduces  its 
usefulness  as  an  appropriate  means  for  conflict  resolution. 
Because  of  the  complexity  and  intensity  of  the  calculations,  it 
is  likely  that  this  method  would  degrade,  rather  than  enhance, 
the  system's  performance.  It  is  for  these  reasons  that  the 
proposed  research  objective  was  not  achieved.  Although  it  seems 
simple  to  derive  these  conclusions,  it  is  not  until  one  begins  to 
attempt  these  tasks  that  the  faults  within  them  appear. 


6 . 4  Other  Methods  for  Possible  Performance  Improvements 

Even  though  the  idea  of  using  deterministic  knowledge  to  aid 
in  conflict  resolution  is  not  viable,  it  does  not  mean  that  the 
same  knowledge  can't  be  used  to  enhance  the  accuracy  of  the 
reasoning  process  of  the  knowledge-based  program  itself.  In  this 
section,  several  other  methods  are  briefly  discussed  that  could 
provide  improvements  in  the  performance  of  the  QPA  system. 


6.4.1  Prediction  of  Process  Variable  Values 

The  first  beneficial  use  of  deterministic  knowledge  would  be 
to  predict  the  values  of  process  variables  that  the  sensory 
readings  do  not  provide  directly.  These  predictions  would  be 
obtained  by  quantitative  calculations  using  the  sensory  data 
provided  by  the  system.  These  additional  process  variables  could 
provide  a  better  understanding  of  the  process  and  allow  for 
simpler  construction  of  the  control  heuristics.  For  example,  in 
the  current  QPA  system  th<  viscosity  of  the  system  is  predicted 
using  the  dielectric  sensor  reading.  Using  process  equations  it 
might  be  possible  to  use  this  dielectric  sensor  reading  (e2) 
along  with  other  sensory  or  calculated  variable  values  to  obtain 
a  viscosity  value.  Then,  instead  of  using  the  e2  value  in  the 
rules  of  the  system,  the  "viscosity"  value  could  be  used.  This 
would  make  the  rules  easier  to  understand  and  interpret,  since 
one  has  a  better  understanding  of  the  relationships  of  other 
variables  with  viscosity  than  with  e2  values. 

This  type  of  calculation  is  currently  provided  in  automatic 
control  systems  by  the  use  of  computing  relays.  However,  these 
computational  relays  are  limited  in  that  they  can  only  perform 
rudimentary  mathematical  functions  (e.g.,  add,  subtract. 
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multiply,  divide,  compare,  etc.}*  In  a  knowledge-based  program 
the  calculations  would  be  limited  only  by  the  time  constraints  of 
the  process,  or  the  hardware  and  programming  constraints  of  the 
system.  In  a  knowledge-based  program  it  would  be  possible  to 
perform  complex  computations,  such  as  numerical  integration, 
optimization,  etc.  The  results  of  the  these  calculations  would  be 
available  for  use  within  the  knowledge  base  of  the  system. 

In  QPA,  this  type  of  system  would  provide  for  the  best  of 
both  worlds,  qualitative  and  quantitative  descriptions  of  the 
process.  To  a  limited  extent,  this  capability  is  provided  in  the 
current  implementation.  QPA  currently  performs  calculations  using 
the  sensory  readings  to  obtain  qualitative  terms,  such  as 
derivatives  and  averages.  These  calculations  are  limited  to 
simple  manipulations  involving  addition,  subtraction, 
multiplication,  and  division.  This  limitation  is  due  to  the 
restrictions  of  the  hardware  and  software  of  the  current 
implementation  and  will  be  rectified  in  the  next  release. 
Therefore,  with  the  new  release  of  the  system,  it  would  be 
beneficial  for  QPA  to  incorporate  more  complex  numerical 
capabilities  into  the  system.  These  calculations  would  be 
performed  by  the  blackboard  monitor  and  used  by  the  Parser  and 
Thinker. 


6.4.2  A  Coupled  System  Approach 

One  problem  with  the  method  described  in  the  previous 
section  is  that  it  requires  time  to  perform  calculations.  The 
time  required  might  exceed  that  allowed  by  the  system  in  order  to 
conform  to  the  real-time  response  constraints  set  by  the  process. 
One  method  of  reducing  the  time  would  be  to  off-load  the 
calculations  to  a  separate  processor  and  to  perform  only  those 
calculations  that  the  system  needs.  Within  each  episode  of  QPA 
the  process  is  only  concerned  with  certain  process  instance 
frames.  Therefore,  it  would  be  necessary  to  perform  the 
calculation  of  only  those  process  variables  that  are  used  within 
the  process  instance  frames  specified  by  the  current  episode. 
This  reduces  the  number  of  calculations  that  the  system  performs. 
The  resulting  system  might  be  considered  a  coupled  system. 

A  coupled  system  is  one  which  combines  numeric  and  symbolic 
computing  (see  Figure  6.1).  Even  though  some  of  the  current 
knowledge-based  systems  use  some  form  of  numeric  processing  they 
are  not  necessarily  coupled  systems.  To  be  considered  a  coupled 
system,  the  "system  must  have  some  knowledge  of  the  numeric 
processes  embedded  within  them  and  must  reason  about  the 
application  or  results  of  these  numeric  processes"  [27].  For 
example,  even  though  the  system  described  in  the  previous  section 
uses  both  symbolic  and  numeric  computing,  it  would  not  be  a 
coupled  system.  This  is  because  the  numeric  calculations  are 
perfoxnned  during  every  cycle,  and  the  system  has  no  control  over 
the  application  of  them.  Whereas,  in  the  proposed  system  of  this 
section,  the  system  determines  which  calculations  to  perform 
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Numerical  Computing 

-  Data  processing  &  reduction 

-  Simulation  &  quantitative  models 

-  Statistics  and  probability 

-  Numerical  Algorithms 
Application:  algorithmically  tractable 
problems 


Symbolic  Computing 

-  Data  interpretation 

-  Cognitive  &  qualitative  models 

-  Search  and  heuristics 

-  Non-deterministic  solution  strategies, 
Application:  problems  intractable 
algorithmically 


Figure  6.1:  Computing  Techniques  [27]. 


based  on  the  episode  information  contained  in  the  knowledge  base. 
The  results  of  the  calculations  are  then  used  in  the  heuristics 
to  determine  the  proper  control  action. 

The  resulting  system  would  provide  speed  improvements  unless 
every  calculation  is  needed  in  every  episode.  For  such  a  case, 
the  number  of  calculations  would  be  identical  for  each  episode. 
Specification  of  which  calculations  to  perform  would  be  contained 
with  the  episode  frame  using  a  new  slot. 
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CHAPTER  7 


CONCLUDING  REMARKS 


This  research  investigated  improving  the  performance  of 
knowledge-based  process  control  systems.  The  method  involved 
using  deterministic  knowledge  to  enhance  conflict  resolution 
within  the  system. 

During  the  investigation  it  became  evident  that  conflict 
resolution  has  not  received  much  attention  in  the  literature. 
Based  on  the  articles  reviewed,  the  strategies  of  conflict 
resolution  can  be  categorized  as  either  numeric  or  heuristic. 
Within  the  heuristic  category  six  different  classes  of  strategies 
were  identified.  Each  class  possesses  different  advantages  in 
their  capability  to  order  the  conflict  set.  In  process  control 
applications,  the  numeric  strategies  are  the  favored  technique 
for  conflict  resolution.  But  in  order  to  provide  a  robust  method 
of  conflict  resolution  it  is  necessary  to  combine  both  numeric 
and  heuristic  strategies. 

Examination  of  the  QPA  system  revealed  that  its  technique 
for  conflict  resolution  falls  under  the  "production  order"  class 
of  strategies.  To  make  a  selection.  It  relies  on  the  correct 
placement  of  certain  frames  within  the  knowledge  base,  and  on  the 
use  of  preestablished  priorities  for  rule  effects.  In  order  to 
extend  its  applicability  to  other  processes,  QPA  will  need  to 
make  available  to  the  user  other  heuristic  techniques.  In 
addition,  it  would  be  beneficial  if  QPA  provided  numeric 
techniques  to  provide  for  reasoning  about  the  uncertainty  of  the 
evidence  used  by  the  frame-based  rules.  This  would  provide  a 
physical  link  between  the  resolution  process  and  the 
peculiarities  of  the  process. 

Application  of  deterministic  knowledge  to  resolve  conflicts 
within  QPA  is  not  appropriate.  Attempts  to  use  deterministic 
knowledge  to  resolve  conflicts  results  in  a  partial  redundancy  of 
process  knowledge  and  the  need  to  perform  complex  calculations. 
The  correct  use  of  deterministic  knowledge  would  be  as  an 
additional  source  of  object-level  knowledge,  possibly  within  a 
coupled  system  design. 

Additional  research  is  needed  to  answer  the  following 
questions: 

1.  What  combination  of  conflict  resolution  techniques  is 
best  for  process  control? 

2.  Can  each  resolution  strategy  be  related  to  specific 
characteristics  of  a  process  control  systems? 

3.  What  is  the  appropriate  combination  of  resolution 
techniques  needed  to  handle  each  of  the  possible  types  of 
conflicts  that  arise  (see  Chapter  4)? 

4.  When  is  it  proper  to  use  multiple  levels  of  reasoning? 
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Answers  to  these  questions  are  not  found  in  the  literature.  To 
answer  these  questions,  a  trial  system  will  have  to  be 
constructed  and  demonstrated. 


REFERENCES 


1.  Shortliffe,  E.H.,  Computer-based  Medical  Consultations;  MYCIN. 
Elsevier,  New  York,  1976. 

2.  McDermott,  J.,  "Rl:  A  Rule-based  Configurer  of  Computer 
Systems,”  Artificial  Intelligence,  Vol.  19,  pp.  39-88,  Sept. 
1982. 

3.  McDermott,  J.  and  C.  Forgy,  1978,  "Production  System  Conflict 
Resolution  Strategies".  In  Waterman,  D.  A.,  and  Hayes-Roth,  F. 
(Eds.).  1978.  Pattern-directed  Inference  Systems.  New  York: 
Academic  Press,  pp.  177-199. 

4.  Barr,  A.  and  Feigenbaum,  E.A.,  The  Handbook  of  Artificial 
Intelligence.  William  Kaufmann,  Inc.,  1981,  Vol  1,  p.  197. 


5.  Davis,  R. ,  and  Buchanan,  B.G.,  "Meta-Level  Knowledge:  Overview 
and  Applications,"  Proc.  of  the  5th  Int.  Joint  Conf.  on 
Artificial  Intelligence,  1977,  pp.  920-927. 

6.  Sauers,  R.  and  R.  Walsh,  "On  the  Requirements  of  Future  Expert 
Systems,"  Proc.  of  the  8th  Int.  Joint  Conf.  on  Artificial 
Intelligence,  Vol.  l,  1983,  pp.  110-115. 

7.  Ferrante,  R.D.,  "The  Characteristic  Error  Approach  to  Conflict 
Resolution,"  Int.  Joint  Conf.  on  Artificial  Intelligence,  1985, 
pp.  331-334. 

8.  Tanimoto,  S.L.,  The  Elements  of  Artificial  Intelligence:  An 
Introduction  Using  LISP.  Computer  Science  Press,  1987. 

9.  Charniak,  E.,  and  McDermott,  D. ,  Introduction  to  Artificial 
Intelligence.  Addison-Wesley  Pub.  Co.,  1985. 

10.  Cruise,  A.,  Ennis,  R. ,  et.al.,  "YES/Ll:  Integrating  Rule- 
Based,  Procedural,  and  Real-time  Programming  for  Industrial 
Applications,"  Third  Conf.  on  AI  Applications,  IEEE  1987,  pp. 
134-139. 

11.  Garvey,  T. ,  Lowrance,  J.D.,  and  Fischler,  M.A.,  "An  Inference 
Technique  For  Integrating  Knowledge  From  Disparate  Sources," 
Proceedings  of  the  7th  Int.  Joint  Conf.  on  AI.,  Vancouver, 
British  Columbia,  Canada,  1981. 

12.  "GEST  -  The  Anatomy  of  an  Expert  System  Tool,"  Product 
Literature,  Georgia  Tech  Research  Institute. 


82-58 


13.  Laffey,  T. J. ,  Cox,  P.A. ,  Schmidt,  J.,  Kao,  S.,  and  Read,  J., 
"Real-Time  Knowledge-Based  Systems,"  AI  Magazine,  Spring  1987, 
pp.  27-45. 

14.  Efstathiou,  J.,  "Rule-based  Process  Control  Using  Fuzzy 
Logic,"  In  Approximate  Reasoning  in  Intelligent  Systems.  Decision 
and  Control .  Sanchez  &  Zadeh  (eds.),  Pergamon  Press,  1987,  pp. 
145-158. 

15.  Davis,  R. ,  and  Lenat,  D.B.,  Knowledae-based  Systems  in 
Artificial  Intelligence.  McGraw-Hill  Book  Co.,  1982. 

16.  Baur,  P.S.,  "Development  Tools  Aid  in  Fielding  Expert 
Systems,"  InTech,  April  1987,  pp.  7-15. 

17.  Burg,  B. ,  Foulloy,  L. ,  Heudin,  J.  ,  and  Zavidovique,  B.  , 
"Behavior  Rule  Systems  for  Distributed  Process  Control,"  Proc.  of 
the  2nd  Conf.  on  Artificial  Intelligence  Appl.,  1985,  pp.  198- 
203. 

18.  Gallanti,  M. ,  et.al.,  "Representing  Procedural  Knowledge  in 
Expert  Systems:  An  Application  to  Process  Control,"  Proc.  of  the 
9th  Int  Joint  Conf.  on  Artificial  Intelligence,  pp.  345-352. 

19.  Moore,  R.L.,  and  Hawkinson,  L.B.,  "Progress  Report  on  Expert 
System  Installations,"  Proc.  of  ISA  1985,  pp.  549-552. 

20.  Rao,  M. ,  Jiang,  T.-S.,  and  Tsai,  J. ,  "Integrated  Environment 
for  Intelligent  Control,"  Proc.  of  the  1st  Int.  Conf.  on  lEA/AIE 
1988,  pp.  466-474. 

21.  Wright,  M.L.,  Green,  M.W. ,  Fiegl,  G.,  and  Cross,  P.F.,  "An 
Expert  System  for  Real-Time  Control,"  IEEE  Software,  March  1986, 
pp.  16-24. 

22.  Garrett,  P.  ,  Lee,  C.W.,  and  LeClair,  S.R.,  "Qualitative 
Process  Automation  .VS.  Quantitative  Process  Control,"  Proc.  of 
American  Control  Conf.,  1987. 

2 3  .  Bobr ow ,  D . G . ,  Qualitative  Reasoning  About  Physical  Systems , 
The  MIT  Press,  Cambridge,  Mass,  1985. 

24.  LeClair,  S.R.,  "SENSOR  FUSION:  The  Application  of  Artificial 
Intelligence  Technology  to  Process  Control,"  86  Rochester  Forth 
Conf.,  June  1986,  pp.  15-22. 

25.  Lee,  C.W.,  "Composite  Cure  Process  Control  by  Expert 
Systems,"  First  Conf.  on  Composite  Materials,  American  Society 
for  Composites,  Dayton,  OH,  Oct.  7-9,  1986. 


82-59 


26.  Matejka,  R.F.,  and  Lagnese,  T.J.,  ”A  Representational 
Language  for  Qualitative  Process  Control,”  Proc.  of  the  First 
Int.  Conf.  on  Industrial  and  Engineering  Applications  of 
Artificial  Intelligence  and  Expert  Systems,  1988,  pp.  475-482. 

27.  Kitzmiller,  C.  T. ,  Kowalik,  J.  S.,  "Coupling  Symbolic  and 
Numeric  Computing  in  Knowledge-Based  Systems,”  AI  Magazine, 
Summer,  1987,  pp.  85-90. 


82-60 


